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MAI  .1  (Invited) 
8:45am-9:15am 


HIGH  SPEED  CMOS  IMAGING 


E.  Fossum,  A.  Krymski 
Photobit  Corporation 

135  N  Los  Robles  Ave,  7th  Floor,  Pasadena,  CA,  91101 
E-mail:  fossum@photobit.com 


High  speed  imaging  has  been  undergoing  very  revolutionary  changes.  Just 
recently  CCDs,  having  spent  more  than  3  decades  of  development  and  spending 
many  hundreds  of  millions  of  dollars,  were  hard  pressed  to  reach  the  milestone 
of  250  kilo-pixels  at  1000  frames  per  second.  A  typical  high  speed  CCD  camera 
is  a  computer  rack  size  box,  filled  with  many  components:  multiple  power 
supplies,  pulse  generators  and  drivers,  ADCs  etc.,  consuming  tens  of  Watts  of 
power.  Comparatively,  high  speed  CMOS  sensors  are  just  getting  started  (the 
first  sensor  for  high  speed  was  designed  only  about  two  years  ago),  but  have 
already  exceeded  this  data  throughput  with  4,1  Mega-pixels  at  240  FPS,  and 
progress  is  continuing  so  rapidly  that  no  one  can  predict  what  will  be  possible  in 
say  two  years. 

This  paper  analyzes  basic  architectural  features  of  CMOS  digital  imagers, 
presents  the  history  of  their  development,  and  gives  examples  of  high-speed 
CMOS  APS  implementation. 

Photodiode  image  sensors  with  MOS  scanning  circuits  were  known  from  mid 
60s,  however,  they  were  not  embraced  commercially  because  of  poor 
performance  and  large  pixel  size  (for  that  time)  relative  to  that  of  the  CCD.  In  the 
early  90-s  CMOS  sensors  re-emerged  as  a  low-cost,  low-power  alternative  to 
CCDs  and  because  one  could  integrate  various  signal  and  image  processing 
blocks  such  as  amplifiers,  ADCs,  circuits  for  color  processing  and  data 
compression,  standard  TV  and  computer  I/O  interfaces  on  the  same  chip  [1], 
CMOS  technology  is  now  making  a  breakthrough  on  two  fronts:  sensors  for 
computers  and  cell  phones  on  the  low-end,  and  ultra  high  speed,  large  format 
imaging  on  the  high-end  . 

A  typical  CMOS  APS  (active  pixel  sensor)  contains  3  NMOS  transistors  in  it. 
Although  this  allows  a  very  compact  implementation,  the  sensor  lacks  parallel 
acquisition  of  the  image  data,  a  feature  often  important  in  high  speed  imaging. 
The  alternative  is  a  pixel  that  contains  an  analog  memory  and  is  called  SNAP: 
shuttered-node  active  pixel.  This  approach  will  also  be  addressed  within  this 
presentation. 

The  readout  of  all  pixels  in  one  row  is  usually  done  in  parallel.  This  parallelism 
potentially  allows  very  high  speed  of  data  readout.  An  important  issue  is  how  the 
data  is  multiplexed  into  the  output  pad  (pads).  Multiplexing  of  digital  data  (only  1 
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or  0)  is  much  simpler  than  passing  of  analog  data,  so  the  data  from  the  pixel 
must  be  digitized  prior  to  the  readout.  It  is  then  straightforward  to  read  the  digital 
data  through  a  wide  (8,  10,  or  16  bit)  bus.  In  the  presentation,  different 
architecture  readouts  and  ADCs  are  considered.  We  found  the  column-parallel 
ADC  architecture  extremely  efficient  when  implementing  large  format  high  speed 
image  sensors.  This  approach  is  feasible  for  large  format  because  the  ADCs 
occupy  a  noticeable  area,  so  it  would  not  be  as  suitable  for  realization  of  CIF- 
VGA  format  sensors  operating  at  regular  30-60  Frames/s  speed. 

Another  architectural  feature  that  allows  high  speed  operation  is  pipelining.  It  is 
essential  that  the  readout  of  the  digitized  data  is  done  at  the  same  time  as  pixel 
readout  and  A-to-D  conversion.  While  this  may  sound  trivial,  but  in  the  practical 
implementation,  care  must  be  taken  to  eliminate  the  crosstalk  between  noisy 
digital  data  paths  and  sensitive  analog  nodes. 

The  following  high  speed  CMOS  APS  were  designed  recently: 

1.  1024x1024  500  Frames/s  CMOS  APS  with  rolling  shutter  and  10  urn  pixels  [2] 

2.  2368x1728  240  Frames/s  CMOS  APS  with  rolling  shutter  and  7  urn  pixels 

3.  1280x  1024  600  frames/s  CMOS  APS  with  SNAP  12  urn  pixel. 

The  achieved  performance  will  be  outlined  in  the  presentation  and  sample 
images  captured  by  cameras  built  using  the  sensors  will  be  demonstrated. 

To  conclude,  we  considered  a  number  of  CMOS  APS  architectures  and 
demonstrated  that  CMOS  technology  is  very  suitable  for  the  realization  of  high 
speed  electronic  cameras.  Such  a  technology  yields  low  power  (~0.5  W  for  the 
sensor),  a  compactness  unachievable  with  CCDs,  and  reliability.  The  future  for 
CMOS  technology  in  high  speed  imaging  is  bright. 

References: 

1.  Fossum,  E.R.  ,  “CMOS  Image  Sensors:  Electronic  Camera  on  a  Chip”,  IEDM 
Tech  Dig.,  pp. 17-25,  Dec.  1995. 

2.  A.  Krymski  et.al,  A  High  Speed,  500  Frames/s,  1024x1024  CMOS  Active 
Pixel  Sensor”,  1999  Symp.  on  VLSI  Circuits,  pp.  137-1 38,  1999 
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MAI  .2 

9:15am-9:30am 

MULTICHANNEL  COMPUTER  GENERATED  HOLOGRAMS 

M.  A.  Araiza-Esquivel  and  S.  Guel-Sandoval 

Instituto  de  Investigation  en  Comunicacion  Optica  (IICO),  Universidad  Autonoma  de  San  Luis  Potosi 
(UASLP),  A.  Obregon  64,  San  Luis  Potosi,  S.L.P.  78000,  Mexico 

1.  INTRODUCTION 

Detour  phase  computer  generated  holograms  were  designated  more  than  30  years  ago1, 2,  however 
they  still  remain  under  research  due  to  their  interesting  characteristics.  In  the  late  years  many  researchers 
have  proposed  several  multiplexing  methods3'5,  using  detour  holograms,  where  they  have  been  able  to 
perform  an  image  reconstruction  in  various  diffraction  orders 

As  a  new  approach,  in  this  work  we  propose  handling  two  or  three  flat  partial  objects  (FPO) 
(sections  of  a  whole  flat  object),  that  will  be  stored  in  a  hologram,  in  such  a  way  that  an  image  addition  is 
carried  out  with  the  recovering  being  in  the  same  or  different  direction  and  same  diffraction  order.  The 
used  FPO’s  can  be  parts  of  a  larger  one,  so  a  direct  application  of  this  method  is  to  reunite  as  a  whole, 
several  flat  partial  images  (FPI)  from  corresponding  FPO’s. 


2.  CODIFICATION  TECHNIQUE 

Fig.  1  shows  various  cell  designs,  each  of  size  SvxSv,  with  coordinates  (mSv,  ndv)  where  up  to 
three  rectangles  are  used,  with  the  purpose  of  increasing  the  amount  of  information  codified  in  the 
hologram.  In  next  paragraph,  first  cell  characteristics  and  equations  are  given.  The  others  are  only 
mentioned,  since  their  treatment  is  similar,  however  results  are  shown  with  all  cells  included. 


Figure  1.  Cells’  structure  for  binary  holograms.  Parameters  W  and  P,  with  indices  (m,  n),  codify 
amplitude  and  phase,  respectively.  The  upper  indices  identify  FPO’s. 

Cell  in  Fig.  1(a)  allows  codifying  up  to  4  free  parameters.  Each  rectangle’s  width  is  defined: 
Wx 8v  and  W~x 8v,  and  lateral  changes  are  introduced  by  variables:  P*8vand  P~x5v,  with  constant 

tnn  wn 

height.  Our  purpose  is  to  codify  two  flat  objects:  one  ux(x,y),  with  parameters  W*„bv and  Pxn 5v,  and 

another  u_x{x,y),  with  parameters  W~x5v and  P~xbv.  Both  images  will  be  obtained  in  the  diffraction 

order  ±1 .  The  letters  -X and  X are  used  to  identify  different  flat  objects. 

The  binary  amplitude  transmittance  H  of  the  holograms  can  be  written  as: 


tf(v,-v,)=X  \rect  — 


v,  -mbv-Pxbv 
W*  8v 


v„  -  w8v 

rect 

y 

5v 

+  rect 

vx-m8v+p-n  8v 


Each  value  (m,  n)  refers  to  a  cell.  The  red  function  is  defined  only  horizontally,  since  in  the 
vertical  direction  the  rect  function  is  constant  and  equal  to  Sv  for  all  n,  so  it  can  be  neglected.  Then  at  the 
output  plane,  the  field  distribution  h(x,y)  appears  as  the  inverse  Fourier  transform  (IFT)  of  H(yx,  vy)  as: 

h{x,y) =  £ {(x5v Wx„n)sinc(xbv Wx n)exp\liTu(m  +  P*)5v]+  (xbvWmx  T^in((xbvW-x  v)exp\liTvc(m  - Pj )bv]  (2) 

m/i 

Besides  it  is  desired  that  the  complex  amplitude  h(x,  y )  describes  the  images  ux(x,y)  and 


u~x  (x,  y) .  After  some  approximation,  the  equation  above  is  reduced  to: 
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(3) 


sen(n  WL  )=AL  <p  *  =  2iiPl 

sen(nWj  )  =  Aj  cp~J  =  -2nPj 

In  this  way  it  is  possible  to  reconstruct  both  images  in  the  same  order,  as  we  can  see  next. 

3.  EXPERIMENTAL  RESULTS  AND  CONCLUSIONS 


Fig.  2  shows  different  groups  of  FPO’s  used  to  generate  several  holograms.  Results  are  shown  in 
figure  3  and  discussed  in  text  below: 


Figure  2.  Flat  partial  Objects. 


Figure  3.  Reconstructed  images,  (a)  Using  Fig.  1(a)  and  couple  of  FPO’s  Fig.  2(a);  (b)  Fig.  1(b)  and  Fig. 
2(a);  (c)  Fig.  1(a)  and  Fig.  2(b)  manipulating  some  instruction;  (d)  Fig.  1(c)  and  Fig.  2(c);  (e)  Fig.  1(d) 

and  Fig.  2(d). 

Results  show  it  is  possible  to  reconstruct  a  whole  image  of  two  or  three  FPO’s  which  are  codified 
separately  and  maintained  independent  during  codification.  The  hologram  thus  works  as  a  multiplexing 
system  capable  of  encoding  two  or  three  separated  channels  of  information. 

The  method  has  been  designated  as:  image  addition  by  multiplexed  computer  generated 
holograms  (IAMCGH).  This  IAMCGH  method  can  be  considered,  as  a  different  approach  for  a  system  to 
codify  information  by  sections,  to  be  recuperated  latter  as  a  whole.  The  cell  dimensions  limit  the 
maximum  number  of  rectangles  used  and  consequently  the  number  of  FPO’s  that  can  be 
codified.  For  the  time  being  up  to  three  FPO’s  have  been  recorded.  An  additional  characteristic 
of  the  IAMCGH  method  is  that  it  makes  possible  to  recuperate  the  FPO’s  images  in  the  same  or 
different  directions.  The  authors  acknowledge  Prof.  A.  W.  Lohmann  for  their  suggestions  and 
encouraging  comments.  Besides,  one  of  the  authors  (MAE)  acknowledges  CONACYT  (Mexico) 
for  providing  financial  partial  support  for  this  work.  We  also  acknowledge  J.  Nieto  for  providing 
technical  support. 

4.  REFERENCES 

1.  B.  R.  Brown  and  A.  W.  Lohmann,  “Complex  spatial  filtering  with  binary  masks”,  Appl.  Opt.  5,  967- 
969(1966). 

2.  A.  W.  Lohmann  and  D.  P.  Paris,  "Binary  Fraunhofer  Holograms  Generated  by  Computer”,  Appl. 
Opt.  6,  1739-1748(1967). 

3.  D.  Mendlovic  and  I.  Kiryuschev,  “Two-channel  computer-generated  hologram  and  its  application  for 
optical  correlation”,  Optics  Commun.  116,  322-325  (1995). 

4.  D.  Mendlovic,  “Double-multiplexed  computer-generated  holograms”,  Appl.  Opt.  35,  3887-3890 
(1996). 

5.  R.  ChaoHong,  Z.  Jin  and  G.  WenQi,  “Four-channel  self-focus  computer  generated  hologram”,  Appl. 
Opt.  36,  8844-8847  (1997). 
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9:30am-9:45am 


Polarisation-induced  beating  in  Spatial  Light  Modulator  Optical  Switches 

Hsiao-Hua  Chu,  Melanie  J.  Holmes 

Department  of  Electronic  Engineering,  King’s  College  London,  Strand,  London  WC2R  2LS,  U.K. 

E-mails:  hsiao-hua.  chu@kcl.  ac.  uk,  melanie.j.  holmes@kcl.  ac.  uk 

Abstract  We  report,  for  the  first  time,  results  of  a  foil  polarisation  analysis  of  SLM-based 
switches,  taking  into  account  all  diffraction  orders.  Although  the  switch  efficiency  is  dominated  by  a 
polarisation-independent  term,  we  discover  polarisation  dependent  beating  tffects  with  the  residual 
zero  order. 

Introduction  Spatial  light  modulators  (SLMs)  are  an  attractive  option  for  beam-steering  optical 
switches  due  to  the  rapid  response  time  of  the  liquid  crystal,  the  potential  for  multicasting,  and  the  lack 
of  moving  parts.  The  aim  of  our  research  is  to  design  a  beam-steering  terminal  for  network  protection 
in  submarine  networks.  Previous  results  have  been  reported  in  [1]. 

Polarisation  dependent  analysis  The  basic  operation  of  such  a  switch  has  been  reported  in  [2], 
and  uses  phase  modulation  applied  by  a  ferroelectric  liquid  crystal  SLM  to  steer  an  incident  beam  into 
the  selected  output  waveguide.  In  practice,  the  effect  of  the  phase  modulation  is  to  diffract  the  beam 
incident  on  the  SLM  into  a  number  of  reflected  beams,  each  corresponding  to  a  different  diffraction 
order  of  the  phase  modulation.  Only  one  of  these  diffraction  orders  is  created  intentionally;  the  others 
are  unwanted.  The  selected  diffraction  order  carries  the  message  stream  into  the  required  output 
waveguide,  while  the  unwanted  diffraction  orders  lead  to  crosstalk  and  backreflection.  On  arriving  at 
the  launch  end  of  the  waveguide,  the  tails  of  the  beams  in  the  unwanted  diffraction  orders  will  couple 
into  their  nearest  port,  and  lead  to  crosstalk. 

The  beam  leaving  the  fibre  is  approximated  to  be  Gaussian,  and  for  arbitrary  elliptical 
polarisation  field,  is  therefore  expressed  as  Ein(x,y)  =  (Exx  +  Eye‘s  y)exp[-(x2  +y2)/w,j  ] ,  where  x 
and  y  are  unit  vectors,  defined  to  be  parallel  and  perpendicular,  respectively,  to  the  axis  of  the  FLC 

cone.  Previously,  the  switching  efficiency  into  first  and  higher  diffraction  orders  has  been  shown  to  be 
polarisation-independent  [3],  with  the  result  of  non-ideal  SLM  cell  thickness  and  tilt  angle  being  the 
creation  of  a  residual  zero  order.  We  have  carried  out  a  full  polarisation  analysis  of  the  coupling 
efficiency  and  discovered  polarisation-dependent  beating  effects.  The  field  incident  on  the  SLM  is 
proportional  to  the  Fourier  transform  of  the  electric  field  and  may  be  calculated  from  diffraction 
theory.  The  beam  incident  on  the  SLM  passes  through  a  cover  glass,  a  layer  of  birefringent  liquid 
crystal,  is  reflected  from  a  mirror  pixel  and  then  passes  back  through  the  liquid  crystal  and  cover  glass. 
This  propagation  through  the  SLM  may  be  represented  in  the  form  of  a  Jones  matrix.  Therefore,  the 
field  leaving  the  SLM  depends  on  the  FLC  tilt  angle,  6 ,  and  the  cell  property  (retardation)  r ,  where 
T  =  2nAntlX ;  t  is  the  thickness  of  the  liquid  crystal.  An  is  the  refractive  index  difference  between 
the  fast  and  slow  axis  of  the  liquid  crystal. 

In  normal  operation,  the  main  terms  in  our  result  are  that  the  coupling  efficiency  is  given  by: 
T}  =  rj0+Tjl  +rib ,  where  7]0  is  the  expected  coupling  efficiency  of  the  power  in  the  first  diffraction 
order  (as  in  [3]):  tj0  =  af  (a\2  +  bl2)l  4,  where  a  1  and  b\  are  the  real  and  imaginary  first-order 
Fourier  series  coefficients  of  the  periodic  phase  modulation  applied  by  the  SLM,  and  a,  =sin  26  sin  T 
is  the  modulating  coefficient.  7],  represents  the  power  coupled  in  from  the  evanescent  tail  of  the 
beam  directed  into  the  residual  zero  order:  rj,  =  (a2r  +al)q2  ,  where  a0i  =  cos 20 sin T  and 
a0r  =  cosT  are  the  non-modulating  real  and  imaginary  coefficients  caused  by  the  residual  zero  order, 
and  q  =  exp[-d2  /(2wjj)]  and  we  have  set  the  output  waveguide  position,  d ,  to  be  optimised  to 
receive  a  first  order.  The  polarisation-dependent  term,  r]b ,  is  given  by: 
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2  KxKy 

Vb  =  Ex2  +  Ey2  g/(g0f sin  5  -«0r  cos5)[fel?  +  (60  +  b2)q2  ] 

where  60 ,  61  and  62 ,  are  the  zeroth,  first  and  second  order  imaginary  Fourier  series  coefficients 
of  the  periodic  phase  modulation  applied  to  the  SLM.  In  steady-state  operation  these  zero  and  second 
orders  would  be  very  weak,  so  this  term  would  be  negligible.  However,  they  may  become  significant 
intermittently,  as  the  hologram  is  scrolled  across  the  SLM,  in  order  to  maintain  DC  balance  [2],  What 
is  surprising  about  this  result  (and  we  have  checked  it  several  times)  is  that  the  result  depends  only  on 
the  imaginary  Fourier  series  coefficients.  In  principle,  then,  the  effect  could  be  avoided  by  making  the 
phase  modulation  have  even  symmetry  about  the  centre  of  the  hologram.  However,  in  practice  the 
hologram  pattern  must  be  scrolled  to  maintain  DC  balancing  of  the  pixels.  As  the  pattern  is  scrolled 
across  then  |al|  will  fall  to  zero  and  |6l|  will  become  maximised,  at  a  value  of  up  to  2/tc  for  the  most 
efficient  hologram  patterns. 

The  worst-case  fractional  polarisation-dependent 
fluctuation,  T)h  /t]0 ,  is  plotted  in  the  >figure,  for  a  tilt 

angle,  0,  of  36°,  and  for  values  of  retardation  T  that  are 
ideal,  and  in  error  by  20%  to  40%  of  the  ideal  value.  The 
results  are  plotted  as  a  function  of  the  waveguide  offset 
from  the  output  position  of  the  zero  order,  d ,  and 
assuming  a  spot-size,  w0  of  5.1  pm.  This  figure  has  two 
sets  of  curves  grouped  together:  the  upper  group  (labelled 
PI)  shows  the  polarisation-dependent  power  fluctuations, 
while  the  lower  group  of  curves  in  the  figure  (labelled  RZ) 
shows  the  expected  crosstalk  from  the  residual  zero 
order,  given  by  17,  /r]0  ■  These  results  show  that  for  significant  errors  in  cell  thickness,  and  for  offsets 

of  5pm  or  more  away  from  the  output  position  of  the  residual  zero  order,  the  polarisation-dependent 
fluctuation  becomes  larger  than  the  expected  power  in  the  received  signal. 

It  may  be  shown  that  the  worst-case  polarisation-dependent  fluctuation  is  given  by 
{f]b ! Vo ) worst-case  =  ■  Hence  when  the  expected  crosstalk  from  the  residual  zero-order  is 

-30  dB,  the  polarisation-dependent  fluctuation  would  be  -12  dB.  Therefore,  in  order  to  avoid 
significant  temporal  fluctuations  as  the  polarisation  state  changes,  and  as  the  hologram  pattern  is 
scrolled  across  the  SLM,  these  effects  should  be  taken  into  account  when  beam-steering  SLM-based 
switches  are  designed  For  an  ideal  SLM,  the  residual  zero  order  would  be  absent,  and  so  the 
polarisation-dependent  beating  would  also  disappear. 

Conclusions  We  report,  for  the  first  time,  results  of  a  full  polarisation  analysis  of  the  switch,  taking 
into  account  all  diffraction  orders.  We  discover  a  strong  polarisation  dependent  beating  between  the 
first  order  and  residual  zero  order,  and  weaker  polarisation-dependent  beating  with  the  zero  and 
second  orders  from  the  hologram  (phase  modulation)  pattern.  This  beating  results  in  significant  power 
fluctuations  in  output  channels  located  close  to  the  residual  zero  order,  and  should  be  taken  into 
account  in  future  designs  of  beam-steering  optical  switches  using  ferroelectric  SLMs. 
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In  view  of  the  increasing  demand  for  greater  communication  capacity,  the  bit  rate  for  long  haul 
communication  has  been  rising  alarmingly  over  the  years.  This  has  led  to  explosive  growth  for  high  speed,  large 
bandwidth  light-wave  technology  in  carrying  the  vast  amounts  of  data  reliably,  efficiently  and  cost  effectively  in 
long  haul  communication  systems  [1,2,3],  Consequently,  this  has  motivated  the  optical  communication  technology 
moving  towards  monolithic  integration  of  light  source  and  modulator  devices  operating  at  1.55  um,  as  the 
integration  provides  advantages  such  as  improved  performance,  reliability  and  cost  reduction  [4],  Integrated 
modulator  with  semiconductor  lasers  seems  to  be  a  potential  candidate  in  high-density  wavelength  division 
multiplexing  systems  [5],  One  of  the  main  advantages  of  the  integrated  modulator  is  to  reduce  frequency  chirping, 
resulted  from  the  direct  modulation  of  lasers,  hence  expanding  the  system  bandwidth  [5,6], 

In  this  paper,  10-channel  monolithic  multiple  wavelength  electro-absorption  (EA)  modulators  have  been 
fabricated  onto  a  single  chip  using  a  one-step  quantum  well  intermixing  technique  based  on  arsenic  (As**)  ion 
implantation  through  a  graded  thickness  silicon  dioxide  mask  [7],  Each  individual  modulator  has  a  dimension  of 
400  x  500  pm2,  50  pm  width  of  active  window,  500  pm  cavity  length  and  20  pm  width  of  isolation  trench.  The  EA 
intensity  modulators  were  characterized  at  room  temperature  using  end-fire-coupling  technique.  Samples  used  in 
this  study  had  a  InGaAs/InGaAsP  multiple  quantum  well  (MQW  -  five  quantum  wells)  laser  structure  grown  by 
metal-organic  vapor  phase  epitaxy  and  were  of  the  form  of  a  stepped  graded  index  (GRIN)  hetero-structure. 

Photo-current  measurements  from  0.0  V  to  9.0  V  bias  were  performed  by  end-fire  coupling  a  single-mode 
laser  source  into  the  quantum-well  active  region.  A  tunable  semiconductor  laser,  with  wavelength  tunability  ranging 
from  1500  nm  to  1580  nm,  was  used  in  this  measurement.  Figure  1  shows  the  photo-current  absorption  for  the  EA 
modulators  with  wavelength,  which  have  undergone  intermixing  that  resulted  blue  wavelength  shifting  of  0  nm 
(channel  1),  24  nm  (channel  6),  33  nm  (channel  7)  and  46  nm  channel  (8),  respectively.  Note  that  the  band-edge  of 
channels  9  and  10  cannot  be  determined  due  to  the  limitation  of  the  tunable  laser  source.  Figure  2  shows  the  photo¬ 
current  spectra  of  the  intermixed  modulator  (channel  6)  under  0.0  -  9.0  V  reverse  bias.  Figures  1  and  2  resemble 
optical  absorption  profile  for  quantum  well  structure.  From  Figure  1,  it  can  be  observed  that  the  photo-current 
absorption  curves  have  been  blue-shifted  to  different  degrees  across  the  single  chip.  Only  band-edges  from  6,  out  of 
10,  channels  have  been  measured.  The  band-edges  of  the  quantum  well  are  approximately  found  to  be  1558  nm 
(channel  1),  1556  nm  (channel  2),  1553  nm  (channel  3),  1534  nm  (channel  6),  1525  nm  (channel  7)  and  1512  nm 
(channel  8). 

Although  there  is  no  clear  exciton  peak  observed,  behaviors  relating  to  the  light  and  heavy  holes  transitions 
are  still  relatively  distinguishable  (Figure  1).  For  the  intermixed  samples,  these  behaviors  become  more  poorly 
defined  as  the  degree  of  intermixing  increases,  i.e.  as  the  quantum  well  profile  becomes  parabolic  after  intermixing. 
Figure  2  shows  that  the  absorption  increases  to  longer  wavelengths  in  the  consequence  of  the  broadening  of  the 
absorption  edge,  due  to  the  Franz-Keldysh  (FK)  effect.  Since  the  exciton  peaks  under  0  V  bias  (Figure  1)  are  not 
clearly  resolved,  it  is  not  sure  whether  there  is  any  quantum-confined  Stark  effect  (QCSE)  [8],  However,  it  is  still 
believed  that  good  intermixed  and  un-intermixed  EA  modulator  could  be  fabricated  using  this  technique. 

The  intermixed  EA  modulator  at  channel  6  (ie  band-edge  of  1534  nm)  was  then  reverse  biased  to  assess  the 
modulation  depth  and  the  voltage  swing.  The  tunable  laser  source  was  set  to  1 555  nm,  i.e.  below  the  absorption 
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edge,  for  this  measurement.  A  modulation  depth  of  as  high  as  -8  dB  with  voltage  swing  of  -7.5  V  has  been 
observed. 


Figure  1  Photo-current  absorption  for  modulators, 
with  energy  band-gap  shifted  by  0  nm  (channel  1), 
24  nm  (channel  6),  33  nm  (channel  7)  and  46  nm 
(channel  8)  under  no  bias. 


Figure  2  Photo-current  absorption  for 
modulator,  with  energy  band-gap  shifted  by 
24nm  (channel  6)  under  0.0  -  9.0  V  bias. 


Multiple  wavelength  selective  channel  electro-absorption  (EA)  intensity  modulators  have  been  successfully 
fabricated  on  the  single  chip  using  a  one-step  QW1  process.  A  modulation  depth  of  as  high  as  -8  dB  with  voltage 
swing  of -7.5  V  has  been  measured. 
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Abstract 

Recently  we  have  demonstrated  a  compact  current 
input  oversampling  modulator  for  a  scalable  high  frame 
rate  focal  plane  arrays  [1]  that  enable  frame  rates  over 
than  lOOkfps  operating  in  continuous  imaging  mode.  All 
the  circuits  including  data  lines  and  detectors  were  laid- 
out  to  fit  into  125umX125um  space  using  0.8um  CMOS 
technology.  This  compact  converter  also  provides 
improvement  in  noise  filtering  performance  over  imagers 
that  use  capacitance  at  each  pixel  to  convert  detector 
current  to  voltage.  This  technique  may  allow  noisy 
detectors  to  be  used  at  high  temperatures  with  good 
image  quality. 

1.  Introduction 

Current  imaging  systems  implemented  using  charge- 
coupled  device  (CCD)  technology  with  off-chip  analog-to- 
digital  data  converters  are  limited  in  the  amount  of 
detector  noise  they  can  filter  this  causes  problems  with 
noise  at  high  frame  rates  [2]  or  with  noisy  detectors  (e.g. 
IR  detectors)  Focal-plane-array  (FPA)  technology  [3]  can 
reduce  the  noise  added  to  the  detector  noise  at  each  pixel 
by  eliminating  long  analog  signal  paths,  however  if  a 
capacitor  is  used  to  convert  detector  current  to  voltage,  or 
a  charge  packet,  at  each  pixel,  this  capacitor  “samples”  the 
noise  in  the  detector  before  each  readout  permanently 
embedding  it  with  the  signal. 

Recently  to  overcome  the  speed  limitations  of  the 
conventional  readout  systems,  we  have  developed  frilly 
parallel  readout  systems  which  need  a  compact  ADC  for 
each  pixel  [4][5].  Due  to  the  small  Si  area  available,  a 
compact  ADC  with  high  speed  and  large  dynamic  range  is 
necessary.  A  current  input  oversampling  modulator  was 
found  to  be  a  very  compact  candidate  for  this  imaging 
application  because  it  is  immune  to  component  mismatch, 
and  through  subsequent  digital  filtering  of  the  output 
signals,  a  tradeoff  between  dynamic  range  and  frame  rate 
can  be  achieved.  In  addition,  the  digital  filtering  can 
remove  high  frequency  noise  from  the  image  data, 
resulting  in  an  improved  signal  to  noise  ratio  for  the  ADC- 
based  system. 


2.  Current  input  oversampling  modulator 

Modem  short-channel  CMOS  processes  offer  a  speed 
performance  which  far  exceeds  the  requirements  of  most 
imaging  systems.  Since  shorter  channel  lengths  will  be 
available  in  the  future,  this  speed  advantage  will  be  further 
improved;  however,  accuracy  and  component  matching 
are  expected  to  become  worse.  It  could  be  a  serious 
problem  for  the  fully  parallel  FPA  readout  system  because 
there  are  thousands  of  ADCs  working  simultaneously,  and 
needing  good  uniformity  to  get  a  good  image.  Hence,  it  is 
desirable  to  trade  off  speed  for  accuracy,  and  obtain 
accuracy  advantages  at  the  cost  of  speed  reductions  in  the 
focal-plane-array  ADCs.  One  of  the  ADC’s  which  trade 
off  speed  for  accuracy  is  oversampling  ADC  which  is 
designed  in  this  paper 

3.1  Circuit  and  simulation  results 

Figure  1  shows  the  schematic  of  the  combined 
modulator  circuits.  The  output  of  the  current  buffer  is 
connected  to  the  D/A  converter  and  integrator.  The 
current  integrator  is  implemented  with  a  capacitor.  The 
last  stage  is  a  comparator,  which  compares  the  integrator 
voltage  and  reference  voltage  then  makes  a  one  bit  output 
data  stream  [6].  The  comparator  output  is  also  fed  back  to 
the  DAC  to  control  the  feedback  current  which  makes  the 
comparator  output  average  track  the  input  value. 


Figure  1.  Current  input  oversampling  modulator 

Figure  2  shows  SPICE  simulation  results  with  50kHz 
2uA  sinusoidal  input.  Figure  2(a)  shows  the  integrator 
voltage  variation  and  the  modulator  output.  The  power 
density  function  (PDF)  of  the  output  code  is  shown  in 
Figure  2(b).  Since  this  spectrum  is  of  a  digital  1  bit 
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signal,  the  plot  of  the  PDF  shows  that  most  of  the 
digitization  noise  has  been  shifted  to  frequencies  higher 
than  the  signal  frequency.  This  will  occur  provided  that 
the  ADC  operating  frequency  is  much  higher  than  the 
signal  frequency  (oversampling).  Thus  we  can  digitized 
the  detector  current,  and  almost  arbitrarily  reduce  the 
amount  of  noise  added  in  the  digitization  process  by 
oversampling  at  higher  and  higher  rates  and  then  filtering 
out  the  high  frequency  noise  with  a  DSP  process  (such  as 
the  FFT  used  to  obtain  Figure  2(b)). 


Tf 


Intejrtoroiflpiitvohgt 
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'■  ModiSjlor  output 


(a)  Time  domain 


Frtqutocy  1 

(b)  Frequency  domain 


Figure  2.  Simulation  results 


Frequency 

x  105 

(a)  Test  result 


x  10s 


(b)  Simulation  result 


Figure  4.  PDF  responses  of  the  modulator 

There  are  many  pattern  noises  in  the  ideal  simulation 
which  can  be  removed  by  adding  a  white  noise.  Test 
results  have  less  pattern  noise  because  there  are  many 
noise  sources  which  dither  the  input  signal  [7],  From  the 
simulation  and  test  results,  it  is  not  possible  to  distinguish 
the  system  noise  from  the  quantization  noise.  Which 
means  at  an  oversampling  ration  of  4096  no  noticeable 
noise  is  added  by  our  hardware.  Thus  only  detector  noise 
in  the  signal  band  would  remain  after  filtering. 


3.3  Fabrication  and  Test  Results 

Figure  3  shows  a  layout  and  photomicrograph  of  the 
modulator  circuits  for  FPA  pixels.  An  8x8  focal-plane- 
array  system  was  tested  using  a  100  MFIz  system  clock 
which  means  each  pixel  operates  at  1 .56MHz. 


(a)  Layout  (b)  Photomicrograhp 

Figure  3.  Modulator  Layout  and  photomicrograph 

Figure  4(a)  shows  the  frequency  response  of  the 
system.  The  noise  is  modulated  and  the  in-band  noise  is 
decreased.  Figure  4(b)  shows  a  purely  simulated  result 
from  the  Matlab  for  comparison. 
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Abstract 

The  design,  characterization  and  evaluation  of  silicon 
photodetectors  suitable  for  smart  pixel  based 
applications  are  presented.  Several  different  CMOS 
based  photodetector  structures  are  presented  including 
p-n  junction  detectors  and  bipolar  phototransistors. 

Summary 

Photodetectors  a  fundamental  component  necessary  for 
the  implementation  of  photonic  information  processing 
systems.  [1-2]  In  addition  to  the  ability  to  detect  optical 
signals,  it  is  often  desirable  to  have  a  photodetector 
structure  that  can  monolithically  integrated  with  CMOS 
logic  circuitry.  [3]  In  this  paper  we  report  on  the  design, 
characterization  and  performance  evaluation  of  silicon 
photodetectors  that  can  be  fabricated  within  the 
technology  constraints  of  a  standard  1.2nm  CMOS 
fabrication  process. 

PHOTODETECTOR  DESIGN 

In  this  study,  p-n  junction  photodiodes  are  implemented 
using  a  P+  diffusion  to  N-well  and  N-well  to  P-type 
substrate  resulting  in  a  very  fast  shallow  p-n  photodiode 
and  a  slow  n-p  photodiodes  respectively.  The 
phototransistor  is  implemented  using  a  P+  diffusion  in 
an  N-well  to  form  the  emitter  and  base  regions.  The  P- 
type  substrate(which  is  grounded  in  the  n-well  CMOS 
process)  acts  as  the  lightly  doped  collector  region. 
Figure  1  shows  a  cross-sectional  view  of  a  typical 
photodetector  structure. 


Figure  2  Cross-section  of  a  typical  photodetector. 


PHOTODETECTOR  PERFORMANCE 

Simulation  results  indicate  that  the  dark  currents  for  the 
photo  diodes  and  the  phototransistor  are  very  low  (on 
the  order  of  pico  amps).  Both  photodiode  and 
phototransistor  configurations  can  be  biased  over  the 
range  of  bias  potentials  common  in  digital  logic  circuits. 
Placing  the  p  side  at  a  lower  potential  than  the  n  side 
biases  the  photodiodes.  This  is  relatively 
straightforward  for  most  circuit  designs  especially  for 
the  well  to  substrate  diode  where  the  p-substrate  is 
normally  tied  to  the  lowest  bias  potential  on  the  chip.  In 
addition  to  using  the  photodiodes  independently,  it  is 
possible  to  combined  the  operation  of  both  photodiodes 
by  biasing  the  n-well  to  positive  potential  and  grounding 
the  P+  diffusion  and  the  P-substrate. 

The  phototransistor  is  biased  in  a  floating  base 
configuration.  In  this  configuration,  transistor  action  in 
the  device  amplifies  the  photogenerated  current,  which 
acts  as  a  base  current  in  the  transistor  structure. 

Figure  2  shows  the  simulated  photocurrent  as  a  function 
of  optical  intensity  for  each  individual  photodetector 
illuminated  with  a  600nm  light  source.  As  in  the  figure, 
transistor  action  increase  the  photocurrent  in  the  PNP 
phototransistor  by  a  factor  of  ~  100. 


Photo  currents  for  different  models  at  GOOnm 
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Figure  1.  Simulated  photocurrents  as  a  function  optical 
intensity  for  600  nm  light. 
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Figure  3.  Simulated  Spectral  Responsivity  as  a  function 
of  wavelength  for  the  photodiode  structures. 

As  shown  in  figure  3,  the  spectral  responsivity  of  the 
well  to  substrate  photodiode  increases  as  the  wavelength 
is  increased  from  X,=300nm  to  650nm  where  it  reaches  a 
maximum  at  -0.27A/W.  As  wavelength  is  further 
increased  beyond  675  nm,  the  responsivity  of  the 
photodiode  drops  due  to  a  decrease  in  absorption  in  the 
active  (space  charge)  region  of  the  photodiode. 

As  shown  in  the  figure,  the  spectral  responsivity  of  the 
P+  to  well  photodiode  is  significantly  lower  than  the 
well  to  substrate  photodiode.  This  is  due  to  the  shallow 
nature  of  the  pn  junction  (i.e.  as  the  wavelength 
increases,  a  larger  fraction  of  the  light  is  absorbed  deep 
in  the  substrate  where  it  can  not  be  collected  by  the  P+ 
to  Well  diode).  Thus,  the  P+  to  Well  photodiode  has  a 
maximum  responsivity  at  450nm. 

Simulated  transient  analysis  of  these  photodetectors 
shows  that  the  P+  to  well  photodiode  is  very  fast  with 
very  small  rise  and  fall  times  (on  the  order  of  pico 
seconds).  The  rise  time  of  the  well  to  substrate  and  the 
combined  photodiodes  is  very  small  in  the  order  of  pico 
seconds  but  the  fall  time  is  fairly  large  around  85ns. 
This  is  explained  by  the  long  lifetime  for 
photogenerated  carriers  deep  inside  the  substrate.  The 
rise  time  of  the  phototransistor  is  around  30|is  and  fall 
time  75|is.  The  fall  time  is  large  compared  to  the  rise 
time  as  the  collector  is  the  large  substrate  and  hence 
carrier  lifetime  affects  the  transistor  shut  off. 

Current  Status 

Results  shown  in  this  paper  are  simulated  results 
generated  with  the  commercial  2d  material/device 
simulation  package  MEDICI.  Structures  similar  to 
those  simulated  have  been  produced  in  a  CMOS  layout. 
These  test  structures  have  been  sent  for  fabrication 
through  the  MOSIS  foundry  service.  Devices  are  due 
back  for  testing  in  April  2000.  Based  on  the  evaluation 
of  these  test  device,  comparison  between  simulation 


results  and  actual  device  performance  will  be  presented 
during  the  summer  of  2000. 

CONCLUSIONS 

In  this  paper  we  have  demonstrated  and  characterized 
photodetectors  suitable  for  smart  pixel  applications. 
Based  on  1)  the  compatibility  with  CMOS  fabrication 
processes,  2)  the  ability  to  detect  small  power  optical 
signals  3)  the  ability  to  directly  interface  with  the 
conventional  electronic  logic  circuits,  we  believe  that 
these  photodetectors  can  be  successfully  used  in  the 
designed  photonic  information  processing  systems 
based  on  photonic  VLSI  device  technologies. 
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Abstract:  Frequency  locking  and  tracking  of  a  micromachined  tunable  VCSEL  to  an  external 
spectrometer  over  a  7  nm  range  is  demonstrated.  Frequency  locking  to  channels  in  an  arrayed 
waveguide  grating  is  needed  for  WDM  systems. 


1.  Introduction 

Wavelength  division  multiplexing  is  vastly  increasing  the  capacity  of  installed  fiber  and  increasing  the  market  for 
fiber  optic  communications  systems.  However  to  enable  fiber  -to-the-desktop  and  fiber -to-the-home  a  low-cost 
integrated  wavelength  tunable  transceiver  must  be  available.  Arrayed  waveguide  gratings  (AWG)  provide  the 
integrated  multichannel  wavelength  selective  filter  and  micromachined  vertical  cavity  photonic  devices  provide  the 
tunable  sources  and  detectors  needed  for  a  WDM  tra  nsceiver.  The  WDM  transceiver  needs  to  be  able  to  transmit 
and  receive  on  any  wavelength,  which  requires  a  method  of  frequency  locking  the  tunable  devices  to  the  passband  of 
a  channel  in  the  AWG. 

Using  our  micromachined  tunable  vertical  cavity  lasers  (MTV  CSELs)  [1  -3]  we  have  demonstrated  frequency 
locking  and  tracking  to  an  external  spectrometer  over  a  7  nm  range.  The  tuning  range  is  limited  by  the  voltage 
required  to  tune  the  membrane  exceeding  the  output  voltage  range  of  the  op  -amps  used  in  the  frequency  locked  loop 
(FLL)  electronics.  The  current  devices  were  made  in  the  optimized  process  so  their  tuning  voltage  is  abnormally 
high  and  a  new  process  run  would  allow  the  tuning  range  to  be  limited  by  the  operation  of  the  MTVCSEL  ~30  nm. 

2.  Frequency  Locking 

The  method  for  frequency  locking  to  the  slope  of  the  wavelength  passband  is  shown  in  figure  1 . 


Power 

Through 

Dragono 


Laser 

Tuning 

Voltage 


Dragooe 

Diode 

Voltage 


Multiple 

Voltage 


LowPaaa 

Voltage 


Wavelength 

Time 

Wavelength 

Time 

KV- 

Wavelength 

Time 

Kv 

IW- 

|/N/\ 

[V>Ar 

Fig.  1 .  Concept  for  frequency  locking  to  a  wavelength  passband 


A  small  sinusoidal  dither  signal  is  added  to  the  DC  bias  of  the  tuning  membrane  and  causes  a  slight  variation  in 
the  wavelength  of  the  laser,  which  is  converted  to  an  intensity  variation  by  an  external  wavelength  filter.  The 
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intensity  variation  is  measured  with  a  photodiode  and  multiplied  by  the  original  sinusoid  in  o  rder  to  detect  the 
relative  phase,  which  is  set  by  the  local  slope  of  the  wavelength  passband.  Feedback  with  an  integrator  can  then 
follow  the  slope  to  the  peak  of  the  passband.  A  block  diagram  of  the  FLL  system  is  shown  in  figure  2. 


In  order  for  the  system  to  be  stable  under  all  laser  modulation  schemes  and  changes  in  the  laser  output  power  and 
coupling  efficiency  versus  tuning  the  FLL  uses  adaptive  gain.  The  basic  design  is  a  form  of  th  e  externally  excited 
adaptive  loop,  which  has  been  analyzed  by  Horowitz  [4].  The  current  design  is  stable  for  30  dB  of  gain  variation. 

3.  Measurements 

After  fabricating  the  MTVCSELs  a  custom  built  on  wafer  probing  system  is  used  to  measure  their  lasing  a  nd  tuning 
characteristics.  A  microscope  objective  collimates  the  backside  laser  emission,  which  is  routed  to  a  beamsplitter  and 
then  to  a  fiber  coupler  and  the  spectrometer.  The  fiber  -coupled  emission  is  connected  to  either  an  optical  spectrum 
analyzer  or  a  scanning  Fabry -Perot  to  measure  the  output  spectrum. 

The  MTVCSEL  tuning  voltage  is  shown  in  figure  3  for  different  spectrometer  settings  when  the  system  is 
frequency  locked. 


Fig.  3.  VCSEL  tuning  voltage  versus  time  for  different  spectrometer  settings 

The  amplitude  of  the  dither  signal  is  large  in  this  example  because  the  conversion  gain  of  the  multiplier  is  too 
low.  Increasing  the  amplitude  of  the  reference  sinusoidal  input  will  correct  this  and  allow  the  dither  amplitude  to  be 
decreased,  which  will  clean  up  the  optical  spectrum  which  is  presently  about  600  GHz  (2  nm)  wide.  The  excessive 
width  is  likely  caused  by  the  device  going  into  multimode  operation  due  to  excessive  excitation  of  the  membrane. 

The  typical  FWHM  of  th  e  laser  linewidth  is  about  3  GHz  (10  pm)  when  the  membrane  bias  is  not  modulated. 
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Low-cost  silicon  photoreceivers  are  useful  in  telecommunications-  and  optical  interconnection- 
systems.  CMOS  receivers  are  especially  attractive  because  of  their  low  power  and  ease  of  integration  with 
CMOS- VLSI  systems.  1-Gb/s  monolithic  CMOS  receivers  integrating  silicon  photodiodes  (PDs)  in  the 
conventional  CMOS  process  have  been  developed  [1][2].  Hybrid  receivers  integrating  III-V  PDs  with 
CMOS  have  also  been  fabricated  using  either  the  flip-chip  bonding  technique  [3]  or  the  polyimide 
wafer-bonding  technique  we  developed  [4],  However,  the  monolithic  receivers  operate  at  only  0.85-pm 
wavelength  and  require  a  high  supply  voltage  of  over  10  V  for  the  PD.  All  of  the  reported  hybrid 
receivers  also  operate  at  only  0.85  pm,  despite  the  fact  that  hybrid  integration  allows  a  variety  of  device 
combinations.  Furthermore,  none  of  the  reported  CMOS  photoreceivers  have  achieved  a  sensitivity 
below  -10  dBm  at  lGb/s. 

We  have  developed  wafer-bonded  hybrid  receiver  using  0.5-pm  CMOS  that  operates  with  a  single 
3.3-V  supply  voltage.  The  receiver  circuit  is  small  and  simple,  has  high  sensitivity  and  a  broad 
bandwidth,  and  operates  at  either  0.85  or  1 .55  pm  due  to  the  direct  attachment  of  the  III-V  PD  without 
parasitic  capacitance. 

Figure  1  is  a  circuit  diagram  of  the  CMOS  receiver,  which  consists  of  the  III-V  PD,  two  amplifier 
stages,  and  an  off-chip  driver.  The  first  stage  is  a  transimpedance  amplifier  made  of  a  simple  inverter 
and  a  feedback  resistor  (Rf).  Because  the  parasitic  capacitance  on  the  PD  is  below  10  fF  [4],  the  gate 
width  of  the  first-stage  transistor  can  be  small,  and  R,  can  be  set  as  high  as  5kQ  when  the  front-end  of 
the  receiver  has  a  bandwidth  of  over  1  GHz.  Such  a  large  Rf  leads  to  not  only  low  noise,  i.e.  high 
sensitivity,  but  also  high  gain.  The  small  size  and  high  gain  of  the  first  stage  allows  the  use  of  a  second- 
stage  voltage  amplifier  made  also  of  a  simple,  small-sized  inverter.  The  off-chip  driver  consists  of  two 
inverters,  both  with  a  feedback  resistor.  The  driver  is  designed  to  have  a  bandwidth  of  over  1  GHz,  zero 
voltage  gain,  and  a  50-Q  output  impedance.  The  gate  width  of  each  transistor  is  shown  in  the  figure.  The 
gates  are  gradually  widened  so  that  capacitive  charging  time  does  not  limit  the  bandwidth.  The  voltage 
supply  lines  are  divided  into  four  sections  so  that  the  amplified  output  signal  is  not  fed  back  to  the 
sensitive  input  part  through  the  lines. 

Figure  2  is  a  photograph  of  the  receiver.  A  GaAs  PD  is  attached  to  the  CMOS  circuit  by  means  of 
the  polyimide  wafer-bonding,  which  is  followed  by  mesa-etching  of  the  PD  and  interconnection  process. 
As  can  be  seen,  the  area  is  small  because  the  CMOS  circuit  is  simple  and  neither  bonding  wires  nor  large 
pads  are  needed  to  connect  the  CMOS  to  the  PD.  The  PD,  without  an  AR-coating,  showed  a 
photoresponsivity  of  0.42  A/W  and  a  dark  current  below  10  pA  @  5  V.  Figure  3  is  a  measured  eye 
diagram  for  the  GaAs-PD/CMOS  receiver  operating  at  1  Gb/s  when  optical  input  has  -26.4-dBm  average 
power  and  0.85-gm  wavelength.  The  input  was  215-1  PRBS  signal,  and  the  electric  output  off  chip  was 
directly  monitored.  For  the  measurements,  all  four  voltage  supply  lines  were  connected  to  a  single  3.3-V 
source.  A  good  eye  opening  is  observed.  The  O/E  conversion  efficiency  is  about  10000  V/W, 
corresponding  to  a  transimpedance  gain  of  about  25k£2.  Figure  4  plots  bit  error  rates  (BERs).  At  10 10 
BER,  minimum  input  sensitivities  are  -29.7  and  -27.1  dBm  for  bit  rates  of  622  Mb/s  and  1  Gb/s, 
respectively.  Dynamic  ranges  are  21.7  and  19.0  dB,  respectively.  The  receiver  operated  error  free  at  up 
to  1 .3  Gb/s  with  total  power  consumption  below  30  mW.  The  measurement  results  are  summarized  in 
Table  1. 

We  hybridized  InGaAs  PDs  in  almost  the  same  manner  as  the  GaAs  PDs.  The  measurement  results 
using  1.55-pm  light  are  also  summarized  in  Table  1.  The  sensitivities  are  a  little  worse  than  those  for  the 
0.8 5 -pm  receiver.  This  is  attributed  to  high  contact  resistance  and  a  fairly  large  dark  current  of  100  nA. 
0-7803-6252-7/00/$  1 0.0002000  IEEE  17 


In  conclusion,  a  GaAs  or  InGaAs  photodiode  was  attached  to  a  0.5-|lm  CMOS  receiver  circuit 
by  a  polyimide  wafer-bonding  technique.  The  circuit  is  simple  and  small,  and  has  high  sensitivity 
and  a  broad  bandwidth  due  to  the  low  parasitic  capacitance  on  the  photodiode.  The  receiver  operates 
with  a  single  3.3-V  supply  voltage  at  either  0.85  or  1 .55  pm  and  achieves  -27.1 -dBm  sensitivity  at  1 
Gb/s. 
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The  two  dimensional  arrays  of  GaAs/AlGaAs  multiple  quantum  well  (MQW)  modulators  and  detectors 
array  flip-chip  bonding1  directly  over  state-of-the-art  silicon  VLSI  can  allow  high  speed  electronic  processing  and 
high  density  optical  input  and  output2.  A  modulator-based  optoelectronic-VLSI  (OE-VLSI)  circuit  technology  has 
been  developed  at  Bell  Laboratories3'6. 

We  demonstrated  the  hybrid  integration  of  reflection-mode,  surface-normal  GaAs/AlGaAs  multiple 
quantum  well  modulators  and  detectors  array  directly  over  1pm  silicon  CMOS  circuitry  via  flip-chip  solder 
bonding.  Some  MQW  devices  were  used  as  input-light  detectors  and  others  served  as  output-light  modulators, 
which  were  designed  for  850nm  wavelength  operation.  The  GaAs/AlGaAs  multiple  quantum  well  (MQW) 
structure  was  grown  by  molecular  beam  epitaxy.  In  order  to  increase  the  absorption  of  the  intrinsic  region,  the 
period  of  quantum  wells  was  chosen  to  be  90  pairs  and  decreased  the  thickness  of  AlGaAs  barriers  for  decreasing 
for  operation  voltage. 

We  have  fabricated  1  X  20  the  modulators  and  detectors  array  for  optoelectronic  hybrid  integrated  module. 
The  MQW  device  mesa  was  60pm  X  160pm  in  size  and  each  was  consisted  of  a  40pm  X  40pm  P-pad  and  N-pad. 
The  active  region  of  the  device  was  the  P-pad  which  also  serve  as  a  reflector.  The  bump-bond  pad  sizes  and  optical 
windows  sizes  were  26pm  X  26pm.  Indium  was  deposited  on  both  GaAs  chip  and  silicon  chip  for  the  solder.  The 
circuits  contain  transimpedance  receiver,  switching  nodes  and  modulator  driver,  as  shown  in  Fig.  1.  The  receiver 
had  four  amplication  stages,  with  the  first  being  of  the  transimpedance  amplifier  and  inverter-based  amplifiers  in 
the  following  three  stages.  The  control  signals  of  A-C  used  as  selecting  output  light  modulators  addressed.  The 
transimpedance  receiver-transmitter  circuit  can  be  operated  at  300MHz  frequency. 

The  GaAs  substrate  was  removed  after  the  bonding  to  avoid  excess  absorption  in  the  substrate.  The  etching 
of  devices  was  stopped  on  the  AlAs  stop-etch  layer,  which  had  been  placed  between  the  GaAs  substrate  and  the 
active  devices  layer.  A  key  feature  of  hybrid  integrated  devices  for  flip-chip  bonding  and  then  GaAs  substrate 
removal  was  the  etching  of  outer  mesas  around  the  MQW  devices.  If  the  etching  were  not  controlled  well,  it  would 
cause  to  break  and  possibly  damaging.  In  order  to  avoid  this  case,  we  developed  the  method  which  when  the  one 
terminal  of  MQW  device  mesas  were  exposed  first  during  the  etching  process  of  GaAs  substrate,  the  etching  was 
stopped  immediately,  as  show  in  Fig.  2.  This  method  would  allow  large  area  of  the  wafer  remain  instead  of  small 
isolated  mesas  left  and  the  exposed  terminals  of  MQW  devices  were  used  as  sign  to  mark  their  location.  It  did  not 
bring  more  absorption  in  remain  GaAs  substrate  since  the  wafer  thickness  was  only  few  micron. 

In  fact,  we  get  nearly  the  same  reflection  spectrum  for  the  four  devices  at  an  array  after  flip-chip  bonding 
and  package,  shown  in  Fig.  3.  It  is  indicated  the  good  optoelectronic  uniformity  of  elements  modulators  and 
detectors  array.  Because  of  the  strong  exciton  absorption,  the  reflectivity  at  wavelength  850nm  is  the  minimum 
under  zero  field,  it  can  be  used  as  a  photodetector  in  input  windows.  In  other  hand  as  light  modulator  in  output 
window,  increasing  with  an  applied  field,  the  MQW  reflection  modulator  contains  the  effects  of  quantum  confined 
Stark  effect,  which  shows  the  shift  of  exciton  peak  and  the  change  of  the  absorption  coefficient  in  quantum  well 
under  applied  electric  field.  Figure  4  shows  the  microspot  reflectivity  spectrum  at  zero  and  5V  biases  of  the  MQW 
device  after  flip-chip  bonding.  When  the  operation  wavelength  is  at  860nm,  the  low  reflectivity  at  an  applied 
voltage  of  5  V  and  the  high  reflectivity  at  0V,  and  the  contrast  ratio  is  more  than  1 .2: 1  for  a  5V  bias. 

The  work  is  support  by  the  Chinese  High  Technology  863  Program  and  National  Natural  Science 
Foundation  of  China  (NSFC)  under  grant  69896260,  69789802,  69776036  . 
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Introduction 

Liquid  Crystal  on  Silicon  (LCoS)  is,  by  now,  a 
relatively  mature  technology  for  the  realization 
of  silicon  backplane  based  microdisplays  and 
spatial  light  modulators.  Most  silicon  backplane 
microdisplays  utilise  forms  of  nematic  material 
and  electro-optic  effects  which,  while  relatively 
slow  switching,  are  capable  of  supporting  grey 
levels  and  thereby  field  sequential  color  at  video 
frame  rates.  The  faster  switching  ferroelectric 
materials  and  electro-optic  effects  generally 
operate  in  a  binary  mode,  so  being  capable  of 
supporting  bit-plane  sequential  color  at  video 
frame  rates.  We  have  explored  the  two 
combinations  of 

•  tristate  antiferroelectric  liquid  crystal  on 
silicon,  and 

•  thresholdless  antiferroelectric  (or  V- 
shaped)  liquid  crystal  on  silicon. 


Pixel  count 

1280x1024  (SXGA) 

Pixel  pitch 

lOum 

Aperture  ratio 

>  80% 

Frame  rate 

>  60Hz  full  colour 

Contrast  ratio 

>30:1 

Table  1 .  Outline  taraet  SDecification  for  MINDIS 

The  primary  aim  of  our  collaboration,  entitled 
MINDIS  (Miniature  Information  Display  System) 
has  been  to  characterise  the  suitability  of  such 
materials  for  use  on  reflective  silicon 
backplanes  and  to  design  and  evaluate  an 
AFLCoS  microdisplay  whose  outline  target 
specification  is  shown  in  Table  1 . 


Figure  1 .  Example  of  results  of  AFLC 
material  characterisation 


Materials 

Electro-optical  characterization  has  been  carried 
out  on  a  number  of  candidate  materials  and  a 
comparative  evaluation  made.  DC  and 
switching  response  have  been  measured  in 
reflection  as  well  as  the  effect  of,  for  example, 
temperature  and  ageing.  Figure  1  shows  an 
example  of  such  characterization.  It  should  be 
noted  that  only  amplitude  (not  phase) 
modulation  has  so  far  been  explored. 
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Silicon  backplane 

A  silicon  backplane  has  been  designed, 
fabricated  and  functionally  tested  successfully. 
It  consists  of  an  array  of  1280x1024  pixels  on  a 
lOum  pitch  as  indicated  on  Figure  2.  The  pixel 
circuit,  Figure  3,  is  a  2-transistor  design 
consisting  of  a  PMOS  analogue  switch  and  a 
PMOS  FET  which  stores  charge  on  its  gate. 


column  data  bus 


In  order  to  minimise  hardware  design  risk  and 
retain  maximum  AFLC  drive  waveform  flexibility, 
minimal  peripheral  circuitry  has  been  included 
on  the  chip.  Silicon  wafers  are  currently 
undergoing  in-house . 

Drive  scheme 

A  major  undertaking  within  the  project  has  been 
the  design  of  appropriate  driving  waveforms  for 


TSAFLC  and  V-shaped  materials.  Such  drive 
schemes  have  to  take  into  account  both 
materia!  and  backplane  issues.  Several 
candidate  waveforms  have  been  developed  and 
successfully  tested  on  single  pixel  test 
structures  using  discrete  drive  circuits. 

In  order  to  maintain  the  flexibility  to  further 
optimise  these  waveforms  under  the  constraints 
imposed  by  our  SXGA  backplane,  a  field 
programmable  gate  array  (FPGA)  based 
solution  has  been  adopted  for  the  prototype 
interface.  The  pixel  driving  waveform 
originates,  in  effect,  from  a  fully  programmable 
waveform  generator. 

Conclusions 

TSAFLC  and  V-shaped  materials  have  been 
shown  to  be  physically,  optically  and  electrically 
compatible  with  a  CMOS  active  matrix 
backplane.  They  offer  promise  for  silicon 
backplane  based  microdisplays  and  spatial  light 
modulators.  It  should  be  noted  that  only 
amplitude  (not  phase)  modulation  has  so  far 
been  explored. 
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The  pervasive  implementation  of  optical  links,  including  fiber  to  the  home  and  in-home  fiber  optic 
backbones,  relies  upon  the  cost  reduction  of  optoelectronic  interfaces.  Highly  alignment  tolerant 
photoreceivers  can  reduce  the  packaging  cost  of  optical  links  for  applications,  which  include  those 
mentioned  above,  by  easing  packaging  constraints.  Previously  reported  optical  alignment  tolerance 
measurements  have  been  reported  for  speeds  up  to  3  kbps.  To  achieve  a  high  level  of  optical  alignment 
tolerance  at  higher  speeds,  a  high  responsivity,  large  area,  low  capacitance  photodetector  such  as  the 
inverted  metal-semiconductor-metal  (I-MSM)  detector  can  produce  excellent  results.  Herein,  we  report  on 
the  hybrid  integration  of  a  thin  film  large  area  (250  x  250  |im2),  low  capacitance  (0.43  pF),  high 
responsivity  (0.5  AAV,  with  no  AR  coating)  InGaAs/InP  I-MSM  onto  a  Si  CMOS  differential  receiver 
circuit.  This  integrated  receiver  (shown  in  Figure  1)  demonstrated  a  bit-error-rate  (BER)  of  10  41  at  414 
Mbps  and  O.lxlO'10  at  480  Mbps.  The  alignment  tolerance  of  this  receiver  has  been  modeled  and  measured 
at  200  Mbps,  and  the  theoretical  results  correspond  quite  well  to  experimental  data. 

Photodetector  speed  in  an  integrated  receiver  is  limited  by  either  the  RC  time  constant  of  the 
device  or  the  transit  times  of  photogenerated  carriers  within  the  device.  In  order  to  integrate  highly 
alignment  tolerant  receivers,  large  area  photodetectors  are  desired;  however,  the  input  capacitance  to  the 
receiver  must  be  minimized.  PIN  photodiodes  are  a  common  choice  for  integration  onto  receivers  due  to 
their  high  responsivities  [1,2].  However,  these  devices  have  a  large  capacitance  per  unit  area  that 
ultimately  limits  detector  size  and  alignment  tolerance  for  a  desired  data  rate.  Conventional  (C-)  MSMs 
have  a  low  capacitance  per  unit  area,  but  these  detectors  have  responsivities  much  lower  than  those  of  PINs 
due  to  shadowing  from  the  interdigitated  electrodes  on  the  top  of  the  device.  By  inverting  the  MSM  - 
essentially  flipping  a  C-MSM  upside  down  so  the  electrodes  are  on  the  bottom  of  the  device  and  removing 
the  substrate  -  this  tradeoff  between  large  area  and  high  responsivity  is  eliminated  with  only  a  minimal 
sacrifice  of  device  transient  response  [3], 

The  I-MSM  integrated  onto  the  receiver  shown  in  Figure  1  was  fabricated  from  a  wafer  grown  by 
molecular  beam  epitaxy  (MBE)  with  the  following  nominally  undoped  layer  structure:  400  A  InAlAs  cap 
layer,  500  A  graded  layer,  7400  A  InGaAs  layer,  500  A  graded  layer,  400  A  InAlAs,  2000  A  stop  etch 
layer,  and  InP  substrate.  Ti/Au  (250  A/2250  A)  electrodes  and  contact  pads  were  deposited  via  a  liftoff 
process.  The  electrodes  were  2  |tm  wide  and  had  5  pm  spacings.  The  contact 
pads  were  250  pm  wide  and  40  pm  long.  Mesas  were  defined  by  standard 
photolithography,  and  the  devices  were  mesa  etched  down  to  the  InGaAs  stop 
etch  layer.  The  MSMs  were  then  embedded  in  a  protective  Apiezon  W  wax 
coating,  and  the  substrate  and  stop  etch  layer  were  removed.  The  encapsulated 
devices  were  then  bonded  to  a  transparent  Mylar®  transfer  diaphragm,  and  the 
wax  was  removed.  At  this  point,  the  metal  electrodes  and  contact  pads  of  the 
MSMs  were  exposed  as  the  bonding  surface,  and  the  bottoms  of  the  devices 
were  bonded  to  the  diaphragm.  Such  an  arrangement  facilitated  the  MSM 
inversion  process  during  integration. 

The  differential  Si  CMOS  receiver  circuit  was  fabricated  at  the 
MOSIS  foundry.  It  consisted  of  a  fully  differential  current  mode  input,  current 
to  voltage  conversion,  and  voltage  gain  stages.  Ti/Pt/Au  (250  A/250A/2500  A) 
integration  pads  were  deposited  onto  the  circuit.  The  Ti  acted  as  an  adhesion 
layer,  and  Pt  was  used  as  a  diffusion  barrier.  The  exposed  device  contact  pads  were  used  to  metal-metal 
bond  an  individual  detector  to  the  integration  pads  on  the  receiver  circuit,  resulting  in  fingers  on  the  bottom 
of  the  MSM  device.  A  polyimide  coating  was  spun  onto  the  integrated  circuit  and  cured  at  200  °C  in  order 
to  promote  the  adhesion  of  the  detector  to  the  circuit.  The  polyimide  was  subsequently  removed  using  a 
dry  etch.  The  integrated  circuit  was  then  wire  bonded  to  a  printed  circuit  board  for  testing. 

The  performance  of  the  integrated  receiver  was  tested  using  a  Fabry-Perot  laser  pigtailed  to  a 
single-mode  optical  fiber.  A  BER  transmitter  was  used  to  drive  the  laser  with  a  pseudorandom  pattern. 
The  pigtailed  fiber  was  coupled  via  FC  connectors  to  a  single-mode  fiber  patch  cord  that  was  cleaved  on 
the  opposite  end.  The  cleaved  end  of  the  fiber  was  positioned  over  the  integrated  receiver  using  an  XYZ 


Figure  1  -  InGaAs/InP 
I-MSM  integrated  onto 
Si  CMOS  differential 
receiver  circuit. 
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translation  stage.  The  mode  field  diameter  of  this  fiber  was  given  by  the  manufacturer  as  9.3  pm.  A 
Keithley  Source-Measure  Unit  (SMU)  biased  the  I-MSM.  Photocurrent  as  a  function  of  fiber  to  detector 
misalignment  was  measured  at  200  Mbps. 

The  integrated  receiver  was  modeled  and  tested  for  two  types  of  alignment  tolerance. 
Longitudinal  alignment  tolerance  refers  to  receiver  performance  as  a  function  of  the  separation  between  the 
surface  of  the  photodetector  and  that  of  the  emitting  source.  Transverse  alignment  tolerance  describes  the 
receiver  performance  as  a  function  of  the  off-axis  separation  between  the  center  of  the  source  and  the  center 
of  the  detector. 

A  Gaussian  intensity  distribution  was  assumed  as  a  first-order  approximation  to  the  beam  emitted 
from  the  cleaved  fiber.  In  order  to  estimate  the  power  coupled  from  the  fiber  to  the  detector,  the  intensity 
of  the  Gaussian  beam  was  integrated  over  the  area  of  the  I-MSM  to  calculate  the  percentage  of  power 
coupled  from  the  fiber  to  the  detector  as  a  function  of  longitudinal  separation.  To  predict  the  coupling 
efficiency  under  lateral  misalignments  (with  1  mm  longitudinal  separation),  the  limits  of  integration  were 
appropriately  adjusted  to  effect  a  transverse  separation  between  the  centers  of  the  fiber  and  detector. 
Because  the  device  is  inverted,  carriers  can  be  excited  in  the  areas  of  the  InGaAs  absorbing  region  that  lie 
directly  above  the  metal  contact  pads.  This  has  the  effect  of  extending  one  dimension  of  the  250x250  pm2 
detecting  area  by  2  diffusion  lengths.  Electron  diffusion  length  within  the  InGaAs  was  calculated  to  be  -20 
pm  based  on  the  assumption  of  an  electron  mobility  of -10000  cm  2 /Vs  and  an  electron  lifetime  of  16.7  ns 
(as  described  in  [4]). 

Measured  photocurrent  was  used  to  calculate  power  coupled  from  the  fiber  to  the  detector.  These 
experimental  results  as  well  as  the  theoretical  predictions  for  coupled  power  as  a  function  of  longitudinal 
and  transverse  separation  are  compared  in  Figures  2  and  3.  The  experimental  data  in  Figure  3  was  taken 
with  1  mm  of  separation  between  the  fiber  and  the  detector.  Experimental  and  theoretical  data  in  both 
figures  have  been  normalized  to  optimal  fiber-detector  alignment.  The  discrepancy  between  the  theoretical 
curve  and  the  experimental  data  near  the  1mm  separation  mark  in  Figure  2  may  be  a  result  of  simplifying 
the  emitted  beam  to  a  Gaussian  or  of  the  transient  response  of  the  I-MSM  to  excitation  by  a  200  Mbps 
pseudorandom  signal.  The  discrepancy  in  Figure  2  accounts  for  the  discrepancies  in  the  peaks  shown  in 
Figure  3  since  the  latter  figure  was  evaluated  at  a  longitudinal  separation  of  1  mm. 
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Figure  2  -  Longitudinal  alignment  tolerance 
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Figure  3  -  Transverse  alignment  tolerance 


Thin  film  large  area  I-MSMs  have  been  integrated  onto  Si  CMOS  circuits.  These  integrated 
receivers  have  demonstrated  highly  alignment  tolerant  operation  at  200  Mbps.  Measured  alignment 
tolerance  of  the  receiver  corresponded  quite  well  to  the  performance  predicted  by  theory. 
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"It  is  difficult  to  say  what  is  impossible,  for  the  dream  of  yesterday  is  the  hope  of  today  and  the  reality 
of  tomorrow. "  Robert  H  Goddard. 


Mobile  communications  technology  has  made  significant  progress  from  the  1970/80's  first  generation 
providing  analog  voice-only  services,  to  the  1980/90's  second  generation  enabling  digital  voice  and 
data  services.  The  next  generation  will  be  heralded  by  truly  personal  communication  systems,  which 
are  expected  to  incorporate  a  high  degree  of  multimedia  functionality  (voice  and  data,  images,  and 
video),  mobility  (portable  low  power  devices,  heterogeneous  networks,  roaming),  and  flexibility 
(scalability,  multiple  access,  intelligence).  Thus,  the  new  generation  of  personal  interactive  mobile 
multimedia  mobile  communication  (M3C)  demand  highly  compact  low  power  devices  capable  of 
image  capture  and  display  as  well  as  low  bit  rate  coding  and  transmission.  The  new  design  concept 
based  upon  integration  of  optics  with  that  of  electronics  has  facilitated  the  realisation  of  a  novel 
Intelligent  Pixel  Array  (IP A)  architecture  to  create  a  real-time  implementation  of  forward  and  inverse 
wavelet  transform  (WT)  for  image  capture,  in-situ  processing  and  display. 

The  processor  consists  of  an  array  of  Intelligent  Pixel  (IP)  processing  elements,  as  illustrated 
schematically  in  Figure  1.  Each  element  of  the  array  incorporates  three  components:  (a)  a  photo¬ 
detector  (PD)  with  associated  analogue  to  digital  converter  (ADC)  for  the  conversion  of  the  incident 
light,  (b)  embedded  processor  for  implementation  of  parallel  video  codec  algorithm,  (c)  a  liquid 
crystal  display  for  the  conversion  of  the  decoded  incoming  video  stream  to  an  optical  output  for 
display.  To  carry  out  the  required  video  coding  the  IP  array  is  used  to  emulate  a  mesh  network  of 
processing  elements  such  that  the  inherent  parallelism  and  interconnectivity  of  the  architecture  is  fully 
exploited. 


Figure  1.  Intelligent  Pixel  processor 

A  wavelet-  transformation  is  essentially  a  sub-band  decomposition  that  segments  an  image  into  a 
number  of  frequency  domains  at  different  scales.  A  complete  two-dimensional  decomposition  can  be 
calculated  by  sequential  convolution  of  the  rows  and  columns  of  an  image  with  a  pair  of  suitably 
chosen  quadrature  mirror  filters  (QMFs)  followed  by  half  rate  sub-sampling  at  each  scale.  For  a  VLSI 
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implementation  1-D  FIR  filters  can  be  used  for  these  convolutions.  Suitably  chosen  wavelet  basis 
functions  are  used  to  derive  the  filter  coefficients.  The  transform  coefficients,  y„  are  thus  obtained  by 
convolving  the  pixel  input  values,  x„  with  the  coefficients,  ws,  of  an  FIR  filter  of  length  L: 

y,=±*^,  (■) 

5=1 


This  convolution  can  be  computed  by  scaling  the  image  coefficients  with  each  of  the  filter  coefficients 
and  accumulating  the  sum  of  these  products  in  each  pixel.  This  operation  is  performed  very  efficiently 
on  the  IP  array  as  the  mesh  network  of  processing  elements  allows  all  of  the  convolutions  in  one  plane 
to  be  carried  out  in  parallel.  In  addition,  adoption  of  binary  representable  filters  allows  the 
convolution  operations  necessary  for  the  computation  of  the  discrete  wavelet  transform  (DWT)  to  be 
carried  out  using  integer  arithmetic  only.  A  complete  multi-scale  DWT  can  thus  be  computed  on  the 
IP  array  using  a  single  bit-serial  adder  accumulator  along  with  two  additional  registers  to  store  the 
current  frame  for  left  and  right  shifting.  The  approach  facilitates  decomposition  at  any  scale  with  the 
same  time  complexity.  The  advantage  of  the  algorithm  is  that  at  the  end  of  the  transform  the 
coefficients  are  ordered  together  into  wavelet  blocks  facilitating  embedded  mapping  of  the  Zerotree 
Entropy  Coder(ZTE)  within  the  IP  array. 

Figure  2  outlines  the  logical  architecture  necessary  to  carry  out  the  DWT  using  a  symmetric  binary 
representable  wavelet  filter.  Due  to  the  limited  space  available  within  the  IPs  all  arithmetic  operations 
are  performed  in  a  bit-serial  fashion. 


Optical 


Figure  2.  Logical  architecture  within  each  pixel  for  computation  of  the  DWT 

The  massively  parallel  nature  of  the  IP  array  enables  the  architecture  to  operate  in  real-time  at  very 
high  frame-rates.  A  complete  three-scale  forward  wavelet  transformation  followed  by  ZTE  coding  can 
be  performed  in  less  than  300  clock  cycles  irrespective  of  image  size.  The  algorithm  is  completely 
symmetric  so  that  the  inverse  transform,  necessary  for  displaying  an  image  from  remote  site,  is  simply 
carried  out  in  reverse. 

The  IP  is  capable  of  real-time  very-low-bit-rate  video  coding  with  performance  comparable  to  other 
state-of-the-art  codecs.  Integration  of  optics  with  that  of  VLSI  technology  (Opto-VLSI)  has  provided 
important  foundation  for  realisation  of  new  generations  of  Systems-on-Silicon  (SoS). 
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Introduction 

In  this  paper,  a  detailed  theoretical  and  experimental  analysis  of  the  latency  of  optical  interconnects  in  a 
FPGA  system  is  presented.  The  optical  link  consists  of  a  CMOS  integrated  driver,  an  flip-chip  mounted 
RCLED  coupled  to  a  small-core  POF,  InGaAs  detectors  and  CMOS  integrated  receiver,  comparators  and 
amplifiers. 

The  optically  interconnected  FPGA  system 

FPGAs  are  electronic  components  that  can  be  programmed  to  implement  arbitrary  electronic  designs.  As 
programming  can  be  done  in-situ  and  repeatedly,  FPGAs  are  being  applied  in  rapidly  growing  numbers  in 
a  large  variety  of  applications.  But  FPGAs  are  frequently  plagued  by  a  lack  of  interconnect  capabilities. 
Multi-FPGA  systems  in  particular  suffer  from  the  lack  of  inter-chip  interconnect  capability.  Optoelectronic 
area-I/O  for  multi-FPGA  systems  would  offer  a  definite  advantage  in  terms  of  interconnect  capability. 
They  provide  a  viable  substitute  and  extension  of  the  electrical  interconnect  system  in  multi-FPGA 
systemsfl], 

A  small-scale  optoelectronic  FPGA  is  currently  being  realised  and  will  be  used  in  a  three-chip 
demonstrator  system  as  test  bed  for  these  concepts.  The  frill-custom  CMOS  FPGA  circuit  is  an  8  x  8  array 
of  simple  configurable  logic  blocks  (a  4-bit  function  table,  one  flip-flop),  interconnected  by  a 
programmable  6x6  switch  matrix  fabric.  The  switch  matrices,  traditionally  responsible  for  the  on-chip 
interconnections,  have  now  been  extended  to  include  the  off-chip  optical  interconnections.  The  optical 
components  consist  of  two  8x8  source  arrays  (either  RCLEDs  or  VCSELs)  and  two  8x8  InP  detector 
arrays,  which  are  flip-chip  bonded  to  the  CMOS  circuit  and  actually  overlay  part  of  the  CMOS  circuits. 
Electronic  driving  and  receiving  circuits  are  realised  as  part  of  the  CMOS  FPGA  chip,  and  are  intermixed 
with  the  digital  circuits  [2], 


Fig  1  :  picture  of  the  optically  interconnect  FPGA  demonstrator 

Each  of  the  256  optical  channels  is  designed  to  operate  at  an  information  rate  of  80  Mbit/s,  a  typical  data 
rate  for  high-end  commercial  FPGAs.  To  ensure  reliable  communication  over  so  many  parallel  channels  in 
a  noisy  digital  environment,  AC-coupled  communication  with  Manchester  coded  data  is  used  in  the  design. 
Hence,  the  information  rate  of  80  Mbits/s  requires  a  signalling  rate  of  160  Mbaud.  The  optical  pathways 
between  the  FPGA  chips  and  their  neighbouring  chips  are  two  removable  8x16  POF  ribbons.  The  CMOS 
FPGA  chips  containing  approx.  165,000  transistors  on  a  6.3  mm  x  8.3  mm  area  have  been  fabricated  in 
the  AMS  three  metal  0.6  um  technology. 

Results  and  discussion 

The  latency  of  the  complete  optical  link  is  simulated,  using  Cadence  and  a  large-signal  model  for  the 
RCLED.  The  detector  is  presented  by  a  capacitor.  The  latency  is  defined  as  the  time  difference  between  the 
input  signal  and  the  moment  a  signal  reaches  50%  of  its  static  value.  Fig  2  shows  the  calculated  latency  of 
the  link,  at  different  stages.  The  rise  latency  of  the  RCLED  is  larger  compared  to  its  fall  latency.  This  is  in 
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agreement  with  theoretical  calculations  :  the  delay  can  be  calculated  from  the  rate-equations,  and  is  given 
by: 


arctan/,^0.88^  and  /10(M50%  =  (V2  -l),f^r  =  0.42, 


The  fall  latency  of  the  receiver  is  larger  compared  to  its  rise  latency.  This  effect  is  explained  in  fig.  3.  The 
reference  level  of  the  comparator  (which  is  generated  using  a  low-frequency  filter)  is  smaller  than  the  mean 
value  of  the  peak-to-peak  signal.  Fig  4  shows  the  total  latency  of  the  link,  as  function  of  the  drive  current. 
The  rise  latency  decreases  as  function  of  the  drive  current,  due  to  the  shorter  LED  and  receiver  delay 
(because  of  the  larger  current  density  in  the  LED  and  the  larger  detector  current).  However,  the  fall  latency 
increases,  in  spite  of  the  smaller  RCLED  delay  at  larger  current  densities.  This  increase  of  the  latency  in 
the  receiver  -  as  explained  above  -  overcompensates  the  shorter  delay  in  the  LED. 


-  fall  latency 

-  rise  latency 


4  5  6 

LED  current  [mA] 


LED-  opt.  after  detector  receiver 

current  power  POP  current  volt. 


Fig  2  :  calculated  latency  of  the  optical  link. 


Fig  3  :  influence  of  reference  level  on  the  latency 


Fig  5  :  measured  period  of  the  ring  oscillator 


Finally,  measurements  have  been  carried  out  on  the  demonstrator.  The  FPGA  was  programmed  to  act  as  a 
ring  oscillator,  and  the  frequency  of  the  oscillator  was  measured.  This  frequency  is  inverse  proportional  to 
the  delay  of  the  ring.  It  was  found  that  the  maximal  frequency  is  about  22  MHz,  corresponding  to  a  45  ns 
period  (see  fig  5),  and  a  22ns  round  trip  (including  the  optical  and  the  electrical  path).  More  simulations  on 
the  complete  optical  link  (including  the  complete  electrical  and  optical  path)  were  in  agreement  with  the 
measurement.  As  a  conclusion,  it  was  found  that  the  overall  delay  the  optical  path  (including  drivers  and 
receivers)  is  about  5  to  6  ns.  The  pure  optical  delay  (the  time-of-flight  though  the  fibre)  is  only  a  small  part 
of  it.  The  main  part  of  this  delay  is  generated  in  the  LED  switching  and  in  the  receiver. 
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The  increasing  deployment  of  optic  networks  is  now  reaching  a  point  where  major  simplifications  and  cost  reduction 
can  occur  by  the  use  of  scalable  optically  transparent  switches.  In  the  following  paper  the  design,  assembly  and 
performance  of  a  prototype  1  x8  reconfigurable  holographic  tree-space  switch  is  described  [1].  Central  to  the  switch 
fabric  is  a  ferroelectric  liquid  crystal  (FLC)  on  silicon  spatial  light  modulator  (SLM)  [2]  deposited  with  a  540x  1 
array  of  planar  mirror  strips.  The  input  and  output  ports  to  the  switch  are  fabricated  as  a  linear  array  of  silica  planar 
waveguides  connected  to  single-mode  fibres.  The  waveguide  array  and  the  single  Fourier  transform  lens  for  the  2  / 
holographic  replay  system  are  housed  in  an  opto-mechanical  mount  to  provide  stability.  The  switch  operates  at  1 .55 
pm  wavelength  and  has  a  designed  optical  bandwidth  of  >60  nm. 
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Figure  lb :  plan 
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Figure  1:  Operation  of  the  ROSES  1x8  holographic  routing  switch. 

The  primary  aim  of  the  ROSES  project  was  to  develop  the  technology  required  to  implement  a  large  scale  NxN 
holographic  routing  switch.  As  a  first  step  towards  this  goal,  a  prototype  polarisation  independent  [3,4]  linear  1x8 
fibre  optic  routing  switch  was  developed.  The  switch  was  designed  to  take  a  signal  beam  from  a  single  mode  fibre, 
and  dynamically  route  it  to  one  of  eight  single-mode  output  fibres.  The  switch  is  shown  schematically  in  figure  1,  and 
consists  of  four  main  components:  a  waveguide  array  providing  spatial  fan-in  to  and  from  the  input  and  output  fibres, 
a  Fourier  lens,  a  reflective  binary-phase  ferroelectric  liquid  crystal  silicon  backplane  SLM  that  acts  as  a  beam¬ 
steering  element,  and  a  custom  interface  board. 

The  basic  operation  of  the  switch  is  as  follows:  the  optical  signal  is  launched  down  the  input  fibre,  and  couples  via 
the  ribbon  fibre  into  the  input  waveguide,  from  where  it  propagates  to  the  launch  end  of  the  waveguide  array,  and  is 
collimated- (figure  la)  by  the  Fourier  lens  onto  the  SLM.  The  SLM  consists  of  a  linear  array  of  540  pixellated 
electrodes,  each  1 8  pm  by  6mm,  with  a  2  pm  dead  space  between  the  pixels.  The  SLM  acts  like  a  polarization 


0-7803-6252 -7/00/$1 0.00©2000  IEEE 


29 


independent  phase-only  diffraction  grating  [3,4]  with  a  tuneable  period  and  pattern,  creating  a  deflection  angle,  ([>,  of 
the  reflected  beam,  as  shown  in  figure  lb:  this  figure  shows  the  path  of  the  centre  of  each  beam.  By  controlling  the 
deflection  angle,  the  beam  may  be  made  to  return  to  a  selected  point  on  the  waveguide  array,  from  where  the  signal 
propagates  across  the  waveguide  array  and  into  the  ribbon  fibre  and  output  fibre.  The  deflection  angle  depends  on 
the  phase  modulation  pattern  (the  hologram),  which  is  downloaded  from  a  PC  via  the  custom  interface  board  to 
SRAM  circuitry  integrated  with  the  pixels.  The  hologram  patterns  were  calculated  using  Fourier  series  analysis  under 
the  constraint  of  suppressing  crosstalk  and  backreflection  from  unwanted  diffraction  orders  to  target  levels  of -20  dB, 
and  -40  dB  respectively.  The  number  of  pixels  used  to  display  the  phase  modulation  was  chosen  to  be  sufficient  to 
avoid  excess  crosstalk  and  insertion  loss  penalties  due  to  clipping  of  the  tails  of  the  beam  incident  on  the  SLM.  The 
theoretical  diffraction  efficiency  of  the  holograms  varies  between  4  dB  and  4.8  dB:  4  dB  of  this  is  inherent  to  binary- 
phase  modulation,  while  the  0.8  dB  variation  is  the  loss  penalty  resulting  from  modifications  to  the  pattern  in  order 
to  suppress  crosstalk  from  unwanted  diffraction  orders.  Additionally,  in  order  to  maintain  DC  balancing  without 
severe  interruption  of  the  optical  signal,  the  hologram  pattern  was  scrolled  (one  pixel  at  a  time)  across  the  SLM.  The 
waveguide  array  was  designed  to  maximise  the  wavelength  range  of  the  system.  Custom  optomechanics  were  used  to 
align  and  control  the  relative  positions  of  all  optical  components. 

In  order  to  minimize  the  insertion  loss  through  the  switch,  the  optimum  liquid  crystal  tilt  angle  is  45 
degrees.  For  the  switch  demonstration  we  used  CDRR8,  an  organosiloxane  based  experimental  compound  with  the 
phase  sequence:-  I-(57)-Sc*-(room  temp)  and  a  tilt  angle  of  34  degrees  at  room  temperature.  The  diffraction 
efficiency  of  this  first  SLM  specifically  designed  for  phase  modulation  was  measured  to  be  8.0  ±0.2%,  (1 1.010.1  dB): 
this  value  includes  loss  penalties  due  to  non-ideal  tilt  angle  (0.7  dB),  dead  space  (0.9  dB)  and  cell  thickness  error, 
from  which  the  predicted  insertion  loss  for  the  switch  is  14.4  ±0.4dB  ,  including  connector  loss,  waveguide  loss  and 
lens  aberrations. 

The  switch  performance  was  tested  by  aligning  the  waveguide  array  to  maximize  the  signal  diffracted  into 
the  output  channels,  while  simultaneously  minimizing  the  crosstalk  between  channels.  The  SLM  was  then 
configured  to  direct  light  into  each  of  the  output  channels  in  turn,  and  power  measurements  made  to  determine  the 
full  8x8  crosstalk  matrix.  The  measured  insertion  loss  of  the  switch  was  16.9  ±0.5  dB.  The  excess  2.5  ±0.1  dB  loss  is 
due  to  non-optimum  alignment,  increased  aberrations  introduced  by  misalignment,  and  beam  distortion  due  to  a  non- 
uniform  curvature  of  the  SLM. 

The  signal-to-peak  noise  crosstalk  varied  between  a  worst  case  figure  of -19.1dB  to  optical  isolations  as 
high  as  -40.5dB.  These  results  are  extremely  promising,  as  the  target  crosstalk  figure  was  -20dB  and  the  theoretical 
worst  crosstalk  figure  was  calculated  to  be  -21.0dB,  assuming  perfect  alignment.  The  wavelength  response  of  the 
switch  was  measured  and  on  the  outermost  channel  a  worst-case  insertion  loss  variation  of  4dB  was  found  over  a 
60nm  range  centred  on  1550  pm.  Finally,  the  back  reflection  from  the  switch  was  investigated  and  a  worst  case 
value  of -3  5. 1  dB  was  measured. 

The  ROSES  project  has  brought  together  the  current  technology  in  SLMs,  free-space  opto-mechanics  and  in 
techniques  for  designing  waveguides  and  computer-generated  phase  holograms.  The  first  integrated  1  xN  switch 
using  reconfigurable  phase  holograms  has  been  built  in  a  form  suitable  for  system  trials,  and  to  a  specification  that 
would  enable  it  to  be  used  in  the  target  applications.  The  16.9  dB  insertion  loss  and  -19.1  dB  isolation  were  obtained 
without  the  need  to  precisely  control  each  switching  element.  Significant  improvements  in  these  figures  can  be 
expected.  In  the  former  case  by  optimising  the  FLC  switching  angle  and  thickness,  and  in  the  latter  by  reducing 
aberrations.  The  same  spatial  light  modulators  and  hardware  principles  are  capable  of  extension  to  significantly 
larger  switches. 
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SLMs  based  on  Ferroelectric  Liquid  Crystals  (FLCs)  offer  a  potentially  cheap  and  fast  technology  for  optical 
systems,  however,  they  are  limited  by  their  binary  modulation.  An  efficient  JTC  can  be  implemented  with  FLC  SLMs,  if  the 
1/for  two  pass  architecture  is  used[l].  More  importantly,  the  1/f  system  can  be  made  into  a  very  powerful  yet  optically 
simple  processor  when  it  is  implemented  with  a  liquid  crystal  over  silicon  (LCOS)  binary  phase  modulator[2]. 


Figure  1 .  Schematic  of  the  binary  phase-only  1/f  JTC  using  LCOS 

Recently,  this  architecture  has  developed  around  an  in-house  SLM  based  on  FLC  over  silicon  technology,  known 
as  the  fast  bitplane  SLM  (FBP-SLM)[3].  This  device  has  320x240  unplanarised  pixels  on  a  34pm  pitch  and  is  capable  of 
binary  phase  modulation  at  rates  in  excess  of  10  000  frames  per  second.  The  much  smaller  pixel  pitch  of  the  FBP-SLM 
compared  to  the  transmissive  devices  used  previously,  allows  a  much  more  compact  optical  system  to  be  built  as  in  Figure 
1.  The  fast  detector  array  in  Figure  1  is  currently  a  standard  CCD  camera,  with  two  possible  options  in  the  near  future.  The 
first  is  a  much  faster  CMOS  array  whilst  the  second  is  a  custom  designed  silicon  VLSI  smart  detector  array.  The  basic  unit 
has  been  tested  as  a  bench  based  compact  system  using  similar  mounts  as  the  previous  system.  The  overall  footprint  was 
200x200mm,  most  of  which  was  due  to  the  optical  mounts  for  the  components.  A  more  compact  has  also  been  made  using 
the  optical  rail/cage  system  and  results  from  this  will  be  shown  at  the  conference,  along  with  demonstrations  of  the 
applications  outlined  in  the  following  sections. 

A  very  important  observation  is  that  the  traditional  1/f  JTC  can  be  used  as  a  ‘recogniser’  or  comparator,  where  the 
reference  image  is  unknown  or  unspecified.  This  completely  reverses  the  role  of  the  correlator  and  opens  up  a  whole 
plethora  of  applications  that  revolve  around  object  tracking  and  motion  analysis.  Rather  than  having  a  pre-defined  target  or 
reference  image,  the  input  is  made  up  from  a  sequence  of  frames  from  a  video  source.  One  elegant  scenario  is  when  a 
correlator  is  used  to  compare  sequential  frames  in  a  video  stream  a  production  line.  In  such  an  application,  the  current  frame 
is  the  unknown  and  the  previous  frame  is  the  ‘reference’.  Events  that  occur  from  frame  to  frame  can  now  easily  be  tracked. 
This  is  outlined  in  the  simple  yet  very  powerful  example  of  Figure  2.  Here  we  are  implementing  an  industrial  inspection 
system.  The  current  frame  and  previous  frame  are  synchronised  with  the  progress  of  objects  through  the  system  (in  this 
example,  roadsigns).  If  the  sequence  does  not  change,  then  the  output  correlations  remain  from  frame  to  frame,  however, 
when  a  change  occurs  (in  this  example  a  rotated  roadsign),  then  the  correlation  between  frames  is  interrupted.  Moreover,  the 
cycle  of  distortion  can  be  detected  by  looking  at  the  sequence  of  disturbances  about  the  first  detected  defect. 
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Figure  2.  Example  application  for  an  optical  recogniser. 

Even  gradual  distortions  in  the  object  can  be  picked  up  by  correlating  over  multiples  of  frames  to  look  for  small 
changes.  Most  importantly,  the  whole  process  is  done  without  ever  knowing  anything  about  the  object  being  inspected.  This 
application  is  being  tested  experimentally  as  part  of  an  investigation  into  applications  and  markets  of  this  technology.  One 
area  that  is  being  tested  in  the  inspection  of  labels  on  a  production  line.  The  issue  here  is  to  detect  missing  or  distorted 
letters  in  the  names  printed  onto  labels  in  a  production  process. 

It  is  also  possible  to  use  the  optical  comparator  for  motion  estimation  and  tracking.  In  this  case,  the  current  and 
previous  frames  are  compared  to  see  if  there  has  been  any  motion  between  frames.  The  previous  frame  is  broken  into  a 
number  of  smaller  segments,  which  may  or  may  not  be  overlapping  as  shown  in  Figure  3.  Each  separate  segment  is  then 
correlated  with  the  whole  current  frame  at  a  rate  much  higher  then  the  video  stream  rate. 


Segments 


Current  Previous  Correlation#!  Correlation  #3 

Frame  Frame 


Figure  3.  Basic  concept  or  motion  estimation  with  the  recogniser. 

The  motion  of  each  segment  can  then  be  plotted  based  on  the  motion  of  the  correlation  peaks  within  a  particular 
sector  in  the  output  plane.  From  this,  an  idea  of  motion  between  frames  can  be  derived  and  this  motion  information  is  totally 
independent  of  the  objects  in  the  frame.  Such  a  process  is  vital  in  security,  crowd  density  and  traffic  monitoring  systems  as 
events  often  are  signalled  by  abnormal  motion.  A  more  adventurous  and  longer  term  application  of  this  concept  is  in 
generating  motion  vectors  for  MPEG-2  compression  systems.  With  a  640x512  pixel  comparator  running  at  101 2 3 4  frames  per 
second,  it  would  be  possible  to  obtain  motion  information  in  real-time 
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Figure  1 .  Transpar-TR  ring  network.  Optical  parallel  data  packets  transfer 
data  between  nodes  on  parallel  optical  channels. 
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The  Transpar-TR  is  Figure  1 .  Transpar-TR  ring  network.  Optical  parallel  data  packets  transfer 
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using  three-dimensional  optical  parallel  data  packets  (OPDPs).  Similar  to  most  packet  based  networking  schemes  on 
serial  interconnect  links,  the  OPDPs  contain  data  payload  and  source/destination  node  address  information.  The  data 
pipe  between  nodes  consists  of  a  2-D  array  of  optical  links  each  operating  at  on-chip  clock  rates.  This  data  pipe 
between  nodes  is  designed  to  accommodate  the  sum  of  all  other  smaller  data  pipes  entering  the  network  from 
individual  nodes,  thus  this  system  is  an  example  of  a 

‘firehose’ architecture  [4].  Our  goal  is  to  demonstrate  two  vcsel-msm  chip  one  byte 
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In  Transpar-TR,  each 
OPDP  consists  of  a  4  x  4 
spatial  array  of  eight  bit  bytes 
that  are  transmitted  from  the 
VCSELs  over  eight  clock 
cycles.  Twelve  of  these  sixteen 
channels  are  reserved  for  data, 
one  contains  destination  address 
information,  one  is  a  clock 
channel,  and  two  are  reserved 
for  testing  purposes. 

In  anticipation  of  crosstalk 
problems  between  VCSEL  and 
MSM  signals,  we  developed  a 
novel  slotted  token-ring 
protocol  for  an  N  node  (N  even) 

Transpar-TR  which  allows  collision  free,  high  throughput  network  operation  and  avoids  simultaneous  VCSEL- 
MSM  operations.  In  this  scheme,  each  node  waits  for  a  'ready  to  receive'  signal  from  the  next  node  before 
transmitting  an  optical  packet.  Packets  are  received  and  re-transmitted  at  each  node  as  they  traverse  the  network. 
Using  this  technique,  the  handshaking  protocol  allows  packets  from  alternate  network  nodes  to  occupy  the  network 
concurrently  (as  shown  in  Fig.  I).  Thus  N/2  nodes  transmit  simultaneously.  A  transmit  queue  and  a  receive  queue 
are  included  to  bridge  the  ultra  fast  optical  network  and  relatively  lower  speed  of  the  electrical  smart  pixel  array 
processors,  and  minimize  packet  loss.  Due  to  the  parallel  transmission  of  address  and  data  packets,  the  processing 
latency  is  minimized,  which  differs  from  generic  token-ring  networks. 

Transpar-TR  uses  a  Honeywell  VCSEL-MSM  smart  pixel  supplied  by  the  DARPA-sponsored  foundry  project 
operated  by  the  CO-OP  program  at  George-Mason  University  (co-op.gmu.edu/vcsel)  [6].  The  Altera  FPGA, 
VCSEL-MSM  chip,  and  a  driver  chip  designed  by  our  group  is  mounted  on  a  four  layer  printed  circuit  board  (PCB). 
The  board  is  connected  to  a  host  computer  using  a  high  speed  digital  interface  connector. 

3.  Conclusions 

We  view  this  system  as  a  first  step  toward  the  development  of  an  integrated  smart  pixel  having  active  sources.  The 
PCB  integration  of  Transpar-TR  components  has  several  disadvantages  compared  to  monolithic  or  flip-chip 
optoelectronic  integration  [7].  Perhaps  the  biggest  drawback  is  parasitic  inductance  and  capacitance  of  the  PCB  that 
make  detecting  low  current  MSM  signals  difficult.  However,  one  advantage  of  system  modularity  is  that  each 
component  can  be  individually  tested  and  characterized  before  system  integration. 
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Figure  3.  100  Mbps  operation  (left)  and  resulting  eye  diagram  (right).  The 
bottom  trace  in  the  voltage  driving  the  VCSEL  whose  optical  output  enters  an 
MSM  receiver.  The  middle  trace  is  the  receiver  chip  output. 
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Summary 

The  continuing  rapid  progress  in  micro  system  technologies,  micro  electronics,  and  packa¬ 
ging  techniques  strongly  impacts  developments  and  applications  in  many  technical  fields  like 
computer  and  information  technology  or  signal  transmission  systems.  Often,  signal  transmis¬ 
sion  throughput  falls  back  behind  local  processing  speed  on  chip  level  and  thus  builds  a  bott¬ 
leneck  limiting  overall  system  performance.  In  all  these  areas,  growing  demands  regarding 
signal  transmission  bandwidth,  insusceptibility  against  electromagnetic  interference  and  also 
cost  effectiveness  require  novel  solutions  for  interconnection  techniques. 

Optical  signal  paths  using  fibers  or  waveguides  represent  promising  solutions,  since  they  are 
inherently  insensitive  against  high  frequency  and  electrostatic  fields  and  at  the  same  time 
provide  very  large  bandwidths.  Optical  fibers  have  been  introduced  as  standard  transmission 
media  in  wide  area  networks  for  years  and  also  represent  the  state-of-the-art  for  high  perfor¬ 
mance  local  area  networks.  Even  in  networking  between  locally  distributed  electronic  modu¬ 
les  of  in-vehicle  systems  plastic  optical  fibers  are  utilized  as  high  performance  transmission 
media.  In  order  to  push  the  advantages  of  optical  signal  transmission  further  into  the  short 
haul  range  of  PCB  or  module  level,  cost  efficient  solutions  have  to  be  found  for  the  realizati¬ 
on  of  optical  interconnections. 

The  contribution  presents  concepts,  technologies,  and  first  results  for  novel  hybrid  electri¬ 
cal/optical  circuit  boards  which  extend  the  conventional  setup  of  existing  electrical  printed 
circuit  boards  by  incorporating  additional  layers  with  optical  waveguide  structures.  A  major 
aspect  of  the  inclusion  of  optically  functional  layers  into  conventional  circuit  boards  is  the 
requirement  of  maximum  compatibility  with  existing  board  fabrication  and  assembly 
techniques.  Any  substantial  change  in  board  fabrication  and  assembly  parameters  has  to  be 
avoided  in  order  to  obtain  the  desired  cost  efficiency. 

Thermal  and  mechanical  load  during  board  lamination  and  component  soldering  processes 
dictate  the  use  of  highly  thermally  stable  transparent  polymers.  The  desired  compatibility 
with  mechanical  tolerances  of  existing  board  fabrication  and  component  placement 
techniques  can  be  well  achieved  with  optical  multimode  waveguide  structures  having  core 
cross  sections  in  the  range  of,  e.g.,  (50pm)2  to  (100  pm)2.  Multimode  waveguide  structures 
have  been  fabricated  in  different  polymer  materials  using  hot  embossing.  The  formation  of 
waveguides  was  achieved  by  filling  the  embossed  channel  structures  with  a  core  material  of 
largef  refractive  index  compared  to  the  substrate  material.  Among  other  materials,  we  used 
commercially  available  optical  adhesives  for  this  step.  An  over-cladding  consisting  of  the 
same  material  as  the  substrate  finally  seals  the  waveguide  core  structures.  With  a  typical  in- 
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dex  difference  between  core  and  cladding  materials  of  approximately  0.03,  a  numerical 
aperture  of  0.3  results. 

The  optical  transmission  properties  of  multimode  waveguides  can  be  adequately  modeled  by 
ray  optical  methods  (ray  tracing)  rather  than  field  theoretical  approaches  (FEM,  FDM  or 
BPM)  due  to  the  very  large  number  of  propagating  modes.  A  realistic  description  of  light 
propagation  in  these  waveguides  has  to  include  mode  coupling  phenomena  induced  by  wave¬ 
guide  irregularities  like  a  finite  core  surface  roughness,  which  is  inevitably  introduced  by  any 
fabrication  process.  These  waveguide  deformations  lead  to  mode  coupling,  which  in  turn  has 
a  strong  effect  on  signal  propagation  of  the  entire  optical  interconnect.  The  resulting  transient 
behaviour  is  of  special  importance  for  short  haul  interconnects  like  board-integrated  wave¬ 
guides.  In  order  to  realistically  model  these  properties,  scattering  effects  have  been  included 
into  a  ray  tracing  algorithm  using  a  perturbation  description  of  angular  scattering  intensity 
distributions. 

Different  approaches  for  the  coupling  between  optical  emitters  and  receivers  and  waveguides 
are  compared.  A  first  concept  is  based  on  reflecting  facets  at  the  waveguide  ends  combined 
with  optoelectronic  components  mounted  with  their  axes  orthogonal  with  respect  to  the  wa¬ 
veguide  axes.  A  second  approach  uses  butt  coupling,  in  order  to  minimize  axial  offsets  re¬ 
sponsible  for  beam  widening.  This  is  especially  important  at  the  detector  side,  where  a  large 
waveguide  numerical  aperture  leads  to  a  large  beam  divergence,  thus  limiting  cross  talk  su- 
pression  between  neighbour  channels.  First  results  of  realized  hybrid  electrical/optical  boards 
with  mounted  optoelectronics  (VCSEL  emitter  and  PIN-detector  array  components)  are  re¬ 
ported. 

Examples  are  presented  showing  the  potential  of  board-integrated  optical  interconnects  for 
applications  like  in-vehicle  multimedia  optical  bus  systems. 
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Introduction 

This  paper  presents  a  LabView-based  interchip  electro-optical  parallel  interconnect  simulator.  Such  links 
are  a  possible  solution  for  the  interconnect  bottleneck  in  future  digital  systems.  The  links  and  their 
characteristics  are  quite  different  from  the  traditional  long  distance  optical  links  :  the  components  should  be 
small,  low-cost,  temperature-insensitive  and  simple;  and  the  latency  (the  overall  signal  delay)  and  power 
dissipation  must  be  kept  under  control.  This  simulator  focusses  inb  particular  on  the  modelling  of  the 
Resonant  Cavity  LED  (RCLED),  and  the  performance  analysis. 


Description  of  the  link 

The  parallel  interchip  optical  link  studied  in  this  paper  consists  of  CMOS  integrated  drivers,  an  8x8  array 
of  RCLED  flipchip  mounted  on  the  CMOS  chip,  coupled  to  a  POF  (Plastic  Optical  Fibre),  an  InGaAs 
detector  and  CMOS  integrated  receiver  and  amplifier.  There  is  no  optical  power  control  loop,  nor  time 
regeneration  circuits  included.  The  optical  link  is  currently  built,  in  the  framework  of  the  European  OIIC 
project.  The  simulator  is  implemented  in  Lab  View,  allowing  an  easy  integration  of  different  calculation 
techniques  and  an  easy  user  interface.  The  modelling  of  all  subcomponents  is  shortly  discussed  hereafter. 


The  electrical  model  of  the  driver  and  the  RCLED. 

The  driver  consists  of  CMOS  integrated  buffer  stages  and  a  driving  transistor.  The  driving  transistor  is 
modelled  by  a  current  source  and  a  parasitic  capacitance.  The  carrier  concentration  in  the  RCLED  active 
layer  as  a  function  of  time  is  calculated  by  solving  the  RCLED  rate  equations.  From  the  carrier 
concentration,  the  voltage  drop  across  the  RCLED  is  calculated.  This  allows  to  calculate  the  power 
dissipation. 


Figure  1  :  schematic  representation  of  the  link. 


current  [mA] 


Fig  2  :  calculated  PI  curves  of  RCLED 
coupled  to  POF,  as  function  of  the 
detuning 


The  optical  model  of  the  RCLED  and  coupling  to  POF  fibre 

An  RCLED  has  an  increased  extraction  efficiency  compared  to  standard  LEDs  :  the  intrinsic  spontaneous 
emission  pattern  is  enhanced  by  the  microcavity  effect.  In  this  simulation  tool,  the  temperature  and  current 
dependent  extraction  efficiency  is  calculated  from  the  cavity  parameters. 

NA2p0F  r 

Vextr,POF  =  rlextr,POF,nocav  '  enKav  ~  ~^2  J  eith(A)dA 

sub  int  r  _spectr 
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The  temperature  of  the  flipchip  mounted  RCLED  is  calculated  from  the  ambient  temperature  and  the 
power  dissipation  in  the  driver.  This  results  in  a  decreased  internal  quantum  efficiency  and  a  decreased 
enhancement.  The  coupling  efficiency  from  the  LED  to  the  POF  fibre  is  calculated,  taking  into  account  the 
lateral  misalignment.  Fig  2  shows  the  fibre  coupled  power  of  a  15  um  RCLED,  taking  into  account  the 
saturation  effect.  The  influence  of  the  cavity  tuning  is  shown  in  the  inset.  The  absorption  in  the  POF  is 
included,  but  no  dispersion  effects  are  taken  into  account  (negligible  because  of  the  short  fibre  length, 
typically  30  cm). 


The  detector  and  receiver 

The  detector  is  modelled  by  its  diameter,  responsivity  and  its  junction  capacitance.  The  receiver  is  a 
transimpedance  amplifier,  with  a  capacitive  load.  The  response  of  the  receiver  is  described  by  its  linear 
differential  equation,  which  is  solved  by  an  Euler  method.  From  the  time  domain  trace,  an  eye-diagram  is 
constructed.  An  input  current  noise  source  is  added  to  calculate  the  BER  and  SNR.  The  noise  is  assumed 
to  be  gaussian  distributed,  and  its  rms-value  is  given  by  the  standard  expression  [2], 


Fig  3  :  min  drive  current  of 
RCLED,  as  function  of  device 
diameter 


Fig  4  :  calculated  eye-diagram  of 
optical  link. 


Fig  5  :  calculated  BER  of 
optical  link. 


Discussion  of  the  simulation  results 

First,  an  optical  link  using  120um  core  diameterPOF  is  studied.  Fig  3  shows  the  minimal  drive  current  to 
achieve  a  15uA  detector  current,  and  the  corresponding  temperature  increase  in  the  RCLED  (including  the 
driver  heating)  as  function  of  the  RCLED  diameter.  The  optimal  diameter  is  in-between  45  and  80  um. 
Next,  optical  links  using  a  62  um  POF  are  studied.  Fig  4  shows  a  calculated  eye-diagram  of  a  link  with  a 
30um  diameter  RCLED  and  driven  by  3  mA  current  pulses  at  200  MBps.  The  corresponding  BER  as 
function  of  the  sampling  time  is  shown  in  fig  5.  There  is  clearly  a  wide  time  margin. 

In  conclusion,  a  simulation  tools  for  RCLED-POF-CMOS  based  interconnection  links  is  presented.  This 
tools  allows  us  to  study  the  thermal  and  speed  properties  of  the  complete  link.  Good  agreement  between 
experiments  and  simulations  was  found. 
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With  the  explosive  growth  in  fiber  optic  communication  networks  has  come  a  wide  range  of 
opportunities  for  electronic  enhanced  optical  systems  (1).  This  paper  describes  some  of  the 
opportunities  for  electronic  enhancement  of  fiber  optic  systems,  and  present  some  new  results  of 
device  evaluations  in  this  area.  Recent  standards  efforts  have  turned  their  attention  to  data 
transmission  rates  of  10  gigabit/second  and  above  for  a  single  data  link.  This  is  being  driven  by  a 
number  of  factors,  including  new  industry  standards  efforts  from  the  IEEE,  ANSI,  and  Infiniband 
Trade  Association  (2).  Serial  transceivers  for  10  gigabit/second  have  been  demonstrated,  but 
present  problems  dealing  with  signal  integrity  between  CMOS  logic  chips  and  the  optical  physical 
layer.  One  proposal  is  to  integrate  the  10  Gigabit  serialze/deserialize  function  and  data 
encode/decode  function  into  the  media  access  layer  as  a  single  CMOS  ASIC,  while  an  eye  opener 
function  and  media  dependent  input/output  interface  would  be  implemented  as  a  hybrid  of  silicon 
germanium  (SiGe)  and  gallium  arsenide  (GaAs)  technologies.  We  have  helped  develop  new  SiGe 
manufacturing  processes  such  as  ultra-high  vacuum  chemical  vapor  deposition  which  allow 
integration  of  SiGe  with  standard  CMOS  circuitry  to  form  BiCMOS  devices  using  sub-0.5  micron 
designs  and  standard  200  mm  wafer  substrates.  For  example  (3),  SiGe  cross  point  switches  have 
been  demonstrated  with  17  x  17  channels  capable  of  3.2  gigabit/second  data  rates  while  requiring 
only  3.3  V  and  dissipating  only  3.7  W  of  power.  SiGe  transimpedance  amplifiers  designed  for  10 
gigabit  links  have  exhibited  3  Ghz  bandwidth  with  280  nA  room  mean  square  noise,  sensitivity 
down  to  19  microamps,  and  power  consumption  as  low  as  470  mW. 

This  growing  demand  for  channel  bandwidth  has  also  driven  the  acceptance  of  dense  wavelength 
division  multiplexing  (DWDM).  As  data  rates  approach  OC-768,  multiple  SONET-based  overlay 
networks  are  evolving  into  a  service  transparent  DWDM  structure,  with  features  such  as 
protection  switching,  optical  add/drop,  and  quality  of  service  pushed  to  the  edge  of  the  network. 
There  is  a  clearly  emerging  trend  towards  elimination  of  the  ATM  and  SONET  layers,  and 
towards  the  direct  transmission  of  IP  over  DWDM.  This  new  model  requires  the  DWDM  layer  to 
assume  many  of  the  traditional  functions  associated  with  ATM  over  SONET.  Different 
implementations  have  been  proposed;  for  example,  the  Optical  Internetworking  model  allows  the 
IP  switching/routing  layer  to  operate  at  the  same  data  rates  as  the  DWDM  equipment  and  serve  as 
a  centralized  point  of  pure  IP  network  management.  A  complimentary  viewpoint  known  as  the 
“intelligent  optical  network”  maintains  that  IP  switches/routers  can’t  provide  the  wavelength 
switching,  service  provisioning  and  rapid  path  restoration  functions  required  by  larger  networks, 
and  these  functions  best  reside  in  the  optical  layer.  Such  devices  would  either  be  integrated  with 
or  interface  with  existing  DWDM  equipment,  and  would  provide  the  above  functions  in  addition 
to  handling  the  details  of  signal  transport.  Both  types  of  intelligent  networking  offer  the 
advantage  of  bandwidth  management,  protection  switch  events  within  the  maximum  SONET 
switch  time  of  50  ms  or  better,  and  scalabilty  to  terabit  network  bandwidth.  Furthermore,  there 
are  a  number  of  proposals  to  more  closely  integrate  the  electrical  and  optical  domains  by  linking 
IP  data  packets  directly  to  DWDM  optical  wavelengths,  so  that  the  optical  network  can  take 
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some  advantage  of  the  intelligence  imbedded  in  IP  traffic.  For  example,  the  Optical  Domain 
Service  Interconnect  (ODSI)  coalition  seeks  to  define  common  control  interfaces  between  the 
physical  and  IP  MAC  layers  based  on  derivatives  of  the  Internet  Engineering  Task  Force 
Multi-Protocol  Label  Switching  (MPLS).  ODSI  proposes  control  plane  standards  for  layer  1  and 
2  which  must  be  met  by  both  DWDM  equipment  and  broadband  IP  switches/routers.  A 
complimentary  effort  called  Multi-Protocol  Lambda  Switching  has  been  made  by  the  IETF  itself 
to  link  optical  cross-connects  and  gigabit  routers  through  a  Layer  3  switching  methodology  on 
top  of  the  ODSI  protocols. 

IBM  has  recently  introduced  the  2029  Fiber  Saver,  a  DWDM  solution  designed  to  accommodate 
future  intelligent  network  requirements  (4).  The  2029  offers  80  gigabytes  of  managed  bandwidth, 
and  is  compatible  with  long-haul  networks  using  10  gigabit  links  which  can  scale  the  aggregate 
network  bandwidth  to  over  6.5  terabytes.  The  DWDM  electronics  incorporates  data  retiming, 
reshaping,  and  regeneration,  protection  switching,  and  in-band  management  into  the  edge  of  a 
protocol  independent,  self-healing,  fault  tolerant,  dual  counter  rotating  ring  network.  These 
features  have  been  tested  by  constructing  a  fully  protected  DWDM  network  which  included  two 
IBM  G5  Enterprise  Servers  (capacity  of  1600  MIPS  each)  in  a  40  km  Parallel  Sysplex 
architecture  including  4  sysplex  timers  and  4  intersystem  channel  coupling  links;  four  FICON 
channels,  each  carrying  a  time  division  multiplex  of  4  ESCON  channels  through  a  9032-5  ESCON 
Director  driving  an  IBM  3945  automated  tape  library  with  10  terabytes  storage  at  a  distance  of  70 
km;  and  a  500  user  Gigabit  Ethernet  LAN  which  spanned  a  35  km  hubbed  ring  interconnected 
with  a  Cisco  Catalyst  5000  router.  This  network  operated  error  free  (extrapolated  to  10'15  bit 
error  rate)  over  a  4  week  test  period,  with  99.999  %  availability  of  end  user  applications  even 
when  single  points  of  failure  were  deliberately  introduced  into  the  testbed. 
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The  design  of  high  performance  computing  and  telecommunication  switching  equipment  requires  very  accu¬ 
rately  the  knowledge  of  signal  integrity  and  timing  behavior  of  all  intrasystem  and  intersystem  interconnects 
(i.e.  chip-to-chip,  board-to-board,  cabinet-to-cabinet).  In  order  to  predict  and  optimize  the  signal  transfer 
properties  the  use  of  computational  methods  is  inevitable  during  the  system  design  process.  For  modeling 
and  simulating  the  physical  behaviour  of  electrical  interconnects  very  accurate  models  and  comfortable  soft¬ 
ware  tools  are  available.  But  comparable  models  and  tools  supporting  the  design  of  optical  interconnects  do 
not  exist.  d#7^.t*,l‘'  passim  optical  path 

The  integration  of  optical  interconnects  into  electronic 
systems  results  in  hybrid  electrical-optical  intercon¬ 
nects  (Fig.  1).  To  design  such  systems  the  physical 
behaviour  of  the  electrical  and  optical  parts  has  to 
be  modeled.  The  latter  one  requires  the  considera¬ 
tion  of  the  technological  given  facts  typical  for  the 
addressed  interconnection  level  (component-,  module-, 
and  system  level)  in  order  to  select  appropriate  numer¬ 
ical  methods  and  simulation  models.  In  case  of  guided 
wave  optics  the  interconnection  level  characteristical 
dimensions  determine  whether  the  optical  singlemode 
or  multimode  technology  is  applicable. 


For  interconnects  on  board  level  and  between  boards 
and  cabinets  multimode  waveguides  are  the  appropri¬ 
ate  solution  due  to  their  relaxed  tolerance  requirements 
compared  to  singlemode  technology.  On  printed  circuit 
boards  only  multimode  waveguides  with  cross  sectional 
dimensions  comparable  to  those  of  microstrip  lines  lead 
to  an  acceptable  technology  (Fig.  2).  This,  of  course, 
has  a  significant  influence  on  the  modeling  and  simu¬ 
lation  methodology.  Singlemode  interconnects  can  be 
modeled  by  well  known  numerical  methods  (e.  g.  FEM, 
BPM),  but  these  methods  are  not  applicable  to  optical 
waveguides  with  more  than  1000  propagating  modes, 
where  additionally  manufacturing  caused  roughness  of 
the  waveguide  surfaces  has  to  be  considered.  For  this 


Figure  2:  Electrical  printed  circuit  board  with  optical  mul¬ 
timode  waveguides  [2] 


reason  only  methods  based  on  geometrical  optics  (ray  tracing)  can  be  applied. 


The  modeling  of  the  overall  optical  path  requires  models  of  the  optical  transmitters  (VCSEL)  and  receivers 
(photodiode)  suitable  for  the  ray  tracing  analysis  of  the  optical  waveguides.  To  represent  the  emission  of 
the  optical  source  the  emitted  beam  is  discretized  in  angular  domain  in  order  to  get  a  finite  number  of 
initial  rays.  At  the  end  of  the  passive  optical  path  a  reference  plane  is  defined  which  includes  the  sensi¬ 
tive  area  of  the  photodiode.  Equivalent  to  the  modeling  of  the  laser  the  photodiode  is  also  discretized  to 
model  a  location  and  angular  dependent  sensitivity.  In  case  of  ideal  waveguide  boundaries  every  initial  ray 
would  hit  exactly  one  sub-area  but  in  case  of  rough  surfaces  every  initial  ray  is  able  to  contribute  to  the 
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illumination  of  all  sub-areas.  To  model  the  transmis¬ 
sion  characteristics  of  the  overall  optical  path  a  imd- 
tiport  (Fig.  3)  with  N  inputs,  M  outputs,  mid  time 
dependent  transfer  functions  hnrn.(t)  can  be  defined. 
The  N  inputs  and  M  outputs  represent  the  steradian 
segments  An  of  the  laser  beam  and  the  sub-areas  Bm 
of  the  photodiode,  respectively.  The  transfer  fimetions 
hnm(t)  characterize  the  transfer  behaviour  of  the  opti¬ 
cal  waveguide  in  respect  of  the  optical  power  emitted 
into  the  nth  spatial  angular  element.  These  functions 
are  time  dependent  to  take  mode  dispersion  as  well 
sis  mode  coupling  caused  by  surface  roughnesses  and 
waveguide  inliomogenities  into  account.  Then-  numeri¬ 
cal  determination  can  be  performed  by  a 
waveguides  into  accoimt  [1], 


input  output 

Figure  3:  Multiport  representation  of  an  optical  multimode 
interconnect 


ray  tracing  algorithm  which  Is  able  to  take  surface  roughnesses  of 


The  functions  hnvL(t)  cannot,  be  computed  directly,  but  its  integral  $nm(t)  =  f  hnm(t)dt  being  the  step 
response  of  the  corresponding  optical  path.  Taking  into  account  the  physical  behaviour  of  wave  propagation 
and  the  ray  tracing  modeling  approach  some  important  properties  of  \Pnm(f)  can  be  derived. 


♦  ^nm(t)  is  zero  before  ti,  determined  by  the  finite  propagation  velocity  of  light. 

'^nm(t)  —  0,  t  <  tj  (1) 

♦  ^nm(f)  is  constant  and  positive  after  the  time  t2-  mainly  determined  by  the  mode  dispersion. 

d'ttu.Xf)  =  const.  >0,  t  >  t2  (2) 

♦  ^nm(t)  is  monotonical  rising  during  tj  <t<h. 

=  hnm{t)  >  0,  h<t<  t2  (3) 


To  use  these  functions,  wliich  are  only  given  by  a  fi¬ 
nite  set  of  discrete  points,  for  time  domain  simulations, 
an  appropriate  approximation  should  be  introduced  in 
order  to  avoid  discrete  convolution  algorithms  lead¬ 
ing  to  extremely  high  and  unacceptable  computation 
times.  Due  to  the  fundamental  properties  of  U >nm(f)  an 
approximation  by  a  sum  of  weighted  and  delayed  de¬ 
creasing  exponentud  functions  of  the  kind  «>(1  —  e~at) 
is  proposed.  This  approach  enables  the  application  of 
recursive  convolution  algorithms  leading  to  very  ac¬ 
curate  results  and  short  computation  times  [4].  The 
described  modeling  methodology  is  validated  by  ana¬ 
lyzing  a  slab  waveguide,  whose  transmission  character- 
istics  can  be  determined  very  accurately  applying  the 
Coupled  Power  Theory  [3], 


ioa 

io‘‘ 
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Figure  4:  Normalized  power  distribution  of  ali  41  guided 
monies  of  a  slab  waveguide  (height  100/im)  with  surface  rough¬ 
nesses  after-  a  propagation  length  of  10  cm  and  an  excitation  of 
the  20st  guided  mode  calculated  by  the  Coupled  Power-  Theory 
and  the  extended  ray  tracing  technique 


[1]  Th.  Bierholf,  A.  Wallrabensteiu,  A.  Himmler",  E.  Griese,  and  G.  Mrozynski,  G..  "All  approach  to  model  wave  propagation  in 
highly  multimodal  optical  waveguides  with  rough  surfaces”,  in  Proc.  X.  International  Symposium  on  Theoretical  Electrical 
Engineering  (ISTET’99),  pp.  515  -  520.  Magdeburg.  Germany,  Sept.  1999. 

[2]  E.  Griose,  "Parallel  optical  interconnects  for  high  performance  printed  circuit  boards” ,  in  Proc.  6th.  International  Conference 
on  Parallel  Interconnect pp.  173-183,  Anchorage,  Alaska  (USA),  Oct.  1999. 

[3]  D.  Marcuse,  Theory  of  Dielectric  Optical  Waveguide*,  Academic  Press,  New  York,  1972. 

[4]  M.-Ramme,  E.  Griose,  M.  Kurten,  "Fast  simulation  method  for  a  transient  analysis  of  lossy  coupled  transmission  lines  using  a 
semi-analytical  recursive  convolution  procedure” ,  in  Proceeding s  of  1997  IEEE  International  Svmporium  on  Electromagnetic 
Compatibility,  pp.  277  -  282.  Austin,  Texas  (USA),  1997. 


44 


TuA2.2 

10:45am-1 1:00am 


Optical  Waveguide  Taps  on  Silicon  CMOS  Circuits 

V.  E.  Stenger  and  F.  R.  Beyette  Jr. 

University  of  Cincinnati 
Department  of  ECECS 
PO  Box  210030 
Cincinnati  OH  45221-0030 
vstenger@ececs ,uc .edu  or  bevette@ececs.uc.edu 


Abstract 

A  CMOS  compatible  integrated  optic  waveguide  tap  technology  is  proposed  which  has  potential  for  high  electrical 
and  optical  bandwidth.  The  technology  encompasses  grating  couplers  and  mode-controlled  optical  detection  to 
realize  low-loss,  wavelength-selective  taps. 

Abstract 

As  processor  and  RAM  speeds  approach  the  GigaHertz  regime,  it  is  becoming  more  difficult  to  realize  efficient, 
high-density,  metal  interconnects  at  the  board  and  multi-chip  levels.  The  high  carrier  bandwidth  and  parallelism  of 
optical  interconnects  make  them  a  good  candidate  for  interconnection  of  electronic  gate  technologies.  The 
parallelism  of  optical  signals  has  been  largely  exploited  in  the  form  of  local  and  long  haul  fiber  communications 
networks.  Others  have  proposed  or  demonstrated  various  optical  distribution  schemes  at  the  board  and  chip  levels. 
[1-3]  To  properly  exploit  the  large  optical  carrier  bandwidths,  optoelectronic  interconnects  must  also  include  high¬ 
speed  optoelectronic  conversion  for  receivers,  and  high-speed  electro-optic  modulation  for  transmitters.  For  this 
work,  the  receiver  portion  of  the  optical  interconnect  problem  is  addressed.  In  particular,  a  practical  solution  based 
on  standard  CMOS  and  integrated  optic  polymer  waveguide  technology  is  proposed.  Optical  signals  are  distributed 
proportionately  over  a  series  of  taps  (Fig.  la),  or  are  split  up  by  wavelength  via  grating  couplers  (Fig.  lb). 


(b) 

Figure.  1:  Top  view  of  optical  signals  being  routed  to  multiple  points  on  a  chip  or  circuit  board,  (a)  Distribution  of 
one  signal  with  equal  power  detection  at  each  tap,  and  (b)  division  by  wavelength,  where  each  light  color  represents 
an  additional  wavelength  or  bit  signal. 

The  advantage  of  optics  is  already  apparent  in  Figure,  lb,  in  that  multiple  signals  can  appear  as  different 
wavelengths  on  the  same  optical  “trace”.  The  colors  are  selected  based  on  the  Bragg  grating  pitch  at  each  tap 
device.  In  both  Figure  1  cases,  the  optical  signals  are  converted  to  electrical  ones  via  high-speed  traveling-wave 
photodetectors.  These  photodetectors  are  based  on  standard  CMOS  structures,  and  do  not  introduce  any  new 
complications  to  the  processing.  The  optical  waveguides  are  made  from  polymer  materials  in  the  interest  of 
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economy  of  processing  and  of  planarization  requirements.  The  basic  structure  for  the  traveling  wave  photodetector 
tap  device  is  shown  in  Figure  2.  The  idea  behind  a  traveling  wave  structure  is  that  the  optical  signal  and  induced 
electrical  signals  are  matched  in  velocity  along  the  respective  optical  and  electrical  guides.  High  gain,  speed,  and 
power  handling  can  be  achieved  with  such  designs. 


Ridge 

Electrodes 


Depletion 

Region 


Optical 
Absorption 
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Figure  2:  Detailed  top  and  cross  sectional  views  of  the  tap  design.  Note  the  high-speed  traveling  wave  design  for 
the  photodetector  for  high  speed  optical  to  electrical  conversion. 

A  key  feature  of  the  tap  technology  is  that  a  series  of  taps  is  fashioned  in  such  a  way  that  losses  are  minimized.  This 
is  accomplished  by  transporting  the  light  in  “single-mode”  between  taps.  The  tap  section  itself  is  multimode  in  the 
vertical  (y)  direction,  such  that  light  propagates  in  a  well-defined  pattern  toward  the  detector/substrate  region 
without  scattering  loss.  For  a  certain  tap  length,  the  light  oscillates  back  to  a  profile,  which  can  couple  efficiently 
into  the  exit  guide.  This  is  shown  in  the  figure  3  below. 


Upper  Cladding 

Waveguide  Core 
Lower  Cladding 
CMOS  Devices 
Photodetector 


Figure  3:  Side  view  cross  section  of  the  optical  tap,  showing  multimode  oscillation 

To  date,  basic  structures  have  been  simulated  using  BeamProp  waveguide  optics  design  software  with  promising 
results.  Typical  structure  parameters  are  1  pm  HMDS  upper  and  lower  claddings,  1pm  Polycarbisol  core  layer  with 
a  0.1pm  ridge,  and  tap  lengths  in  the  20  pm  regime.  Grating  coupler  lengths  are  in  the  0.3  to  0.5  mm  regime  to 
ensure  good  wavelength  selectivity.  Fabrication  of  test  structures  is  currently  underway. 
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This  paper  shows  how  to  distribute  given  algorithms  onto  processing  systems  composed  of  several 
numbers  of  optoelectronic  LSI  (OE-LSI)  chips  and  optical  interconnections  so  that  the  potential 
capability  of  the  physical  layer  enhanced  by  the  optical  technologies  are  utilized. 


1  Introduction 

This  paper  is  theoretically  approaching  an  optimal 
algorithm  design  method  for  optically  interconnected 
parallel  processing  systems  in  a  systematic  way  to 
utilize  the  enhancement  of  the  physical  layer  capa¬ 
bilities  supported  by  optoelectronic  LSI  (OE-LSI)  or 
free-space  optical  interconnects.  The  method  could 
be  adapted  to  OCULAR-II  [1]  which  is  a  hierarchi¬ 
cal  parallel  optoelectronic  system  to  yields  its  best 
performance. 


Fig.  1  The  System  Model:  Each  LSI  has  numerical 
indexes. 

2  Model  of  Optically  Interconnected 
Systems 

A  time  frame  to  perform  a  computation  written  by 
y  =  f{x\ ,  X2)  is  composed  of  (1)  the  cost  to  transfer  a 
variable  to  a  chip,  which  we  call  “Elemental  Inter¬ 
connect”:  tc ,  and  (2)  the  cost  to  execute  a  compu¬ 
tation,  namely  the  “Elemental  Computation”:  tp. 
Based  on  the  roadmaps  of  semiconductor  and  opti¬ 
cal  interconnects  technologies  [2]  [3],  the  transmission 
delay  of  interconnection  is  being  reduced  and  will  be 
comparable  to  that  of  the  instruction  cycles  on  a  chip, 
which  is  evaluated  as  tc  ~  tp.  In  addition  to  the  tem¬ 
poral  properties,  the  capabilities  supplied  by  paral¬ 
lel  optical  devices  should  be  represented  as  features 
per  unit  space  and  unit  time  to  clearly  exploit  the 
spatially  dense  channels  and  the  possible  high-speed 
off-chip  clock  frequency. 

Here  we  define  two  indexes  that  represent  the  po¬ 
tential  capability  of  OE-LSI:  (1)  Communication 
Capability  Density  ( CDi )  :  Product  of  the  num¬ 
ber  of  I/O  channels  per  unit  area  of  LSI  i  and  the 
frequency  of  individual  channels.  Here,  i  is  the  index 
of  a  LSI  in  a  entire  system,  (2)  Processing  Capa¬ 
bility  Density  ( PDj )  :  Product  of  the  number  of 
transistors  per  unit  area  of  LSI  i  and  the  frequency 
of  individual  transistor. 

In  addition,  free-space  optics  provides  flexibilities 
for  the  data  transfer  pathways  such  as  fan-out,  fan-in, 


or  reconfigurable  interconnection  topologies.  To  rep¬ 
resent  interconnection  flexibility,  a  matrix  M  =  {rriij} 
called  an  Interconnection  Matrix  is  defined  to  repre¬ 
sent  the  connectivity  among  OE-LSIs  where  the  ex¬ 
istence  of  connections  between  LSI  i  and  j  is  indi¬ 
cated  by  rriij  =  1  and  no  connection  is  expressed  by 

rriij  =  0. 

Picking  up  all  the  Elemental  Interconnects  and  El¬ 
emental  Computations  contained  in  a  given  applica¬ 
tion,  if  some  portion  of  Elemental  Interconnects  or 
Elemental  Computations  can  be  performed  simulta¬ 
neously,  total  latency  will  be  reduced.  To  numerically 
discuss  the  optimal  algorithm,  let  the  amount  of  the 
Elemental  Interconnect  given  to  a  unit  area  of  an  OE- 
LSI  specified  by  index  i  during  a  unit  time  be  “Input 
Density”,  denoted  by  IDi,  and  let  being  the  num¬ 
ber  of  the  Elemental  Computation  allocated  to  LSI 
i. 

Using  above  defined  quantities,  the  required 
amount  of  latency  to  perform  a  given  application  is 
given  by  the  Effective  Processing  Time  which  is 


max 

k 


H  (cb  ■ 


CDj  c  PDj 


suppose  that  the  area  of  every  LSI  in  the  system  is 
equal  to  unit  area,  CDj  and  PDj  respectively  means 
the  maximum  number  of  Elemental  Interconnect  and 
Elemental  Computation  that  can  be  simultaneously 
done,  and  Cijtc  being  latency  between  LSI  i  and  j. 
Fig.  1  schematically  shows  the  definitions  of  the  nu¬ 
merical  indexes  that  indicate  the  potential  capability 
of  the  system. 


Algorithm  f 


Elemental  Computation 

Elemental  Interconnect 


LSI  if 


iLSI  A 
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l 

Possible  reduction  of  latency 

Fig.  2  The  Assignment  of  Loads  and  the  Definition 
of  Cost  Matrix. 


3  Comprehensive  Evaluation  of  Inter¬ 
connect  and  Computation 

As  shown  in  eq.  (1),  bottlenecks  could  happen  if  ex¬ 
cess  loads  of  interconnect  or  computation  are  assigned 
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Table  1  The  Definition  of  the  Cost  Matrix. 

For  every  combination  of  i  €  1,  •  *  • ,  N,  j  6  1,  •  •  • ,  N 
.  f  =  1  (0  <  A,  <  PDi) 


j  d2i-i,2i  =  1  (0  <  Ai  <  PDi 

\  d2i-i,2i  =  +  1  {Ai  >  PDi) 

d2i,2i-\  =  {Ai  >  0) 

(  d2i, 2j-l  —  cij  (rn1?  -  landO  <  1 
(  dti,2j- 1  =  C-!J  c Dj  {IDj  >  CDj) 


<  IDj  <  CDj) 
i ) 


4)  d2j  — 1 ,2i  =  “c«  cjD^  (mO  =  land/Dj  >  °) 

TFIv^I - 

d2i',2JV+l  =  d2IV  +  l,2i  =  0 

(If  the  task  assigned  to  LSI  t  is  the  final  step 

of  the  given  applications.) _ 

Other  elements  are  set  to  be  infinity. 


in  some  part  of  the  system.  Obtaining  efficient  com¬ 
putation  procedure  is  corresponding  to  finding  the 
best  distribution  of  interconnects  and  computations 
that  gives  the  shortest  path  between  nodes  depicted 
by  “Start”  and  “Result”  in  Fig.  1. 

In  order  to  systematically  derive  the  best  way, 
a  mathematical  framework  is  introduced,  which  is 
named  here  as  the  Cost  Matrix,  for  the  represen¬ 
tation  of  allocated  computational  loads  at  chips  and 
interconnections  in  between.  Fig.  2  shows  the  idea  of 
the  relationship  between  the  load  in  the  system  and 
the  element  of  the  Cost  Matrix.  For  an  Elemental 
Computation  assigned  at  LSI  i,  two  nodes  are  de¬ 
fined,  which  are  denoted  by  2t  —  1  and  2t,  and  the 
element  of  the  Cost  Matrix  is  given  as  the  weight  as¬ 
signed  to  the  arc  between  these  two  nodes,  namely 
^2i-i,2ti  on  the  basis  of  the  rule  given  in  Table  1.  As 
the  same  fashion,  the  Elemental  Interconnect  is  also 
evaluated. 

The  weight  allocated  to  reverse  paths  in  the  Cost 
Matrix,  which  is  corresponding  to  the  arc  from  2 i  to 
2i  —  1  and  from  2 j  —  1  to  2i  in  Fig.  2  take  a  critical  role 
to  obtain  the  optimum  algorithm,  by  which  possible 
reduction  of  the  latency,  if  the  task  is  cancelled  at  the 
corresponding  processing  node,  is  indicated. 

3.1  Example 

Suppose  a  system  having  three  chips,  each  of  which 
PD  and  CD  being  1.  The  connections  are  specified 
by  M  shown  in  Fig.  3  (b).  Let  all  of  the  Elemental 
Computation  being  allocated  at  LSI  2  whose  number 
is  7,  and  they  need  an  Elemental  Interconnect  from 
LSI  1.  Then,  the  Cost  Matrix  is  given  by  Fig.  3  (c). 
The  arc  between  node  i  and  j  in  Fig.  3(d)  represents 
the  (i,  j)-element  of  the  Cost  Matrix. 

4  Optimization  Method:  The  Nega¬ 
tive  Closed  Loop 

Since  the  elements  of  the  Cost  Matrix  could  have 
negative  values  based  on  the  definition  of  itself,  there 
may  be  closed  loops  along  which  the  summation  of 
the  elements  results  in  a  negative  value.  We  call  such 
loop  a  Negative  Closed  Loop. 

The  existence  of  Negative  Closed  Loops  indicates 
the  existence  of  more  efficient  algorithms  than  cur¬ 
rent  one,  by  re-allocating  some  portion  of  Elemen¬ 
tal  Interconnect  or  Elemental  Computation  such  that 
the  negative  value  decreases  along  the  closed  loop. 
When  the  algorithm  that  minimizes  eq.  (1)  is  ob¬ 
tained,  there  are  no  Negative  Closed  Loops,  which 
can  be  proofed  by  Network  Flow  theories[4]. 

In  the  example  case,  there  is  a  Negative  Closed 
Loop  along  nodes  @-®-(2)-©-®'©  whose  summa¬ 


tion  of  weight  yields  —5,  as  shown  in  Fig.  3(d).  If 
some  portion  of  the  Elemental  Computations  is  re¬ 
assigned  from  LSI1  to  LSI3  as  shown  in  Fig.  4,  then 
there  is  no  Negative  Closed  Loop,  which  yields  opti¬ 
mum  algorithm  for  this  problem. 


Number  of 
allocated 
Elemental 
Computation 


(b)  Interconnect  Matrix  (c)  Cost  Matrix 


Fig.  3  Algorithm  Formulation:  Initial  Status. 


Fig.  4  Final  Algorithm. 


5  Conclusion 

A  method  to  obtain  the  processing  and  communi¬ 
cation  loads  distribution  that  yields  minimum  pro¬ 
cessing  time  for  a  given  application  in  a  optically  in¬ 
terconnected  system  has  been  shown  considering  the 
properties  of  the  physical  layer  enhanced  by  optical 
technologies  on  the  basis  of  a  matrix-based  mathe¬ 
matical  framework. 

References 

1.  N.  McArdle  et  al.,  IEEE  JSTQE,  Vol.5,  No.2,  pp.250-260 
(1999) 

2.  The  National  Technology  Roadmap  for  Semiconductors, 
SI  A,  1999. 

3.  Technology  roadmap:  Optoelectronic  interconnects  for 
integrated  circuits,  European  Commission,  1998. 

4.  R.  K.Ahuja  et  al.,  NETWORK  FLOW,  Prentice  Hall, 
1993. 


48 


TuA2.4 

11:1 5am-1 1:30am 


Low-power  short-distance  optical  interconnect  using  imaging  fibre 

bundles  and  CMOS  detectors 

C.  Rooman(1'2),  M.  Kuijk(1),  D.  Filkins(3),  B.  Dutta(2),  R.  Windisch(2),  R.  Vounckx*11,  P.  Heremans<2) 

University  of  Brussels  V.U.B.,  department  ETRO,  Pleinlaan  2,  B1050  Brussels,  Belgium 
<2)  IMEC,  Kapeldreef  75,  B3001  Leuven,  Belgium,  tel  ++32-16-281751,  fax  ++32-16-281501,  email:  rooman@imec.be 
(3)  Schott  Fiber  Optics,  122  Charlton  Street,  Southbridge,  MA  01550-1960 


Introduction  and  Description  of  the  link 

We  report  on  the  measured  link  performance  of  a  novel  parallel  optical  link  that  is  to  operate  from  CMOS  chip  to  CMOS  chip. 
The  components  of  the  optical  link  are  shown  in  Figure  1  and  briefly  described  below. 

The  optical  channel  is  a  10-cm  long  imaging  fibre  bundle  with  1.9  mm  cross-section  and  open  area  ratio  of  67%.  The  bundle  is 
assembled  from  7  pm  fibres  with  numerical  aperture  of  1. 

The  light  sources  are  a  2-dimensional  array  of  10  X  10  high-speed  high-performance  light-emitting  diodes,  called  non-resonant 
cavity  LEDs  [1].  The  LEDs  have  a  diameter  of  30  pm  and  are  on  a  pitch  of  100  pm  in  X  and  Y.  The  LED  array  is  designed  to  be 
flip-chip  mounted  onto  CMOS  driver  circuits,  and  the  emission  wavelength  is  850  nm.  To  solve  the  problem  that  the  GaAs 
substrate  is  not  transparent  at  this  wavelength,  the  original  GaAs  substrate  is  replaced  by  a  fiber  optic  face  plate  [2],  with  square 
fibres  of  5.4  pm  with  numerical  aperture  of  1. 

Finally,  the  detectors  and 
receivers  in  the  system  are 
standard  CMOS  detector- 
receiver  cells,  again  arranged 
as  an  array  on  a  pitch  of  100 
pm  in  X  and  Y.  The  detectors, 
called  “Spatially  Modulated 
Light  Detector”  [3]  are 
designed  to  allow  high-speed 
operation.  Each  detector  is  50 
pm  X  50  pm,  and  its 
corresponding  integrated 
receiver  takes  the  remaining 
area  in  the  100  pm  X  100  pm 
cell.  The  detector/receiver 
CMOS  circuit  was  designed  in 
0.6  pm  CMOS,  and  fabbed  in  a 
foundry. 

Apart  from  the  fibre  optic  face  plate  that  serves  as  substrate  for  the  LED  array  and  the  imaging  fibre  bundle,  there  is  no  optics  in 
the  system.  In  particular,  there  is  no  magnification/demagnification  at  the  transition  between  the  source  array  and  the  imaging 
fibre  bundle,  and  no  coupling  optics  at  the  entrance  or  exit  of  the  imaging  fibre  bundle.  The  fibre  is  butt-coupled  to  the  opto¬ 
electronic  devices.  The  large  numerical  aperture  of  the  face  plate  and  the  fibre  bundle  allows  to  capture  and  guide  the  light  of  the 
LEDs  efficiently. 


WJWfffrcir  cmos  f eIiWp'IIb 

Figure  1:  Components  of  the  proposed  low-power  short  distance  optical  interconnect 


Measured  link  performance 

Figure  2  shows  the  open  eye  diagram  of  an  LED  of  the  array  (on  fibre  optic  face  plate)  transmitted  through  the  imaging  fibre 
bundle  at  1  Gbit/s.  The  measurement  was  done  using  a  pseudo-random  bit  stream  of  2  23  -  1  bits.  It  shows  that  the  switching 
speed  of  the  (small)  LEDs  used  in  this  work  allows  Gbit/s  optical  transmission. 

Figure  3  shows  the  digital  eye  diagram  of  the  optical  link  including  the  LED  on  face  plate,  the  imaging  fibre  bundle  and  the 
CMOS  detector/receiver,  at  300  Mbit/s.  We  checked  that  the  detector,  designed  in  0.6  pm  CMOS,  operates  at  bit  rates  up  to  600 
Mbit/s.  The  transmission  speed  of  the  system  is,  today,  limited  by  the  receiver  to  300  Mbit/s.  Simulations  show  that  a  re-design 
in  0.25  pm  should  crank  up  this  speed  to  over  1  Gbit/s  per  channel. 
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Figure  2  :  Eye  diagram  of  optical  pulses  Figure  3  :  Eye  diagram  of  the  system  at  300  Mbit/s 

transmitted  through  imaging  fibre  bundle 


Figure  4  shows  the  bit  error  rate  versus  optical  power  falling  on  the  CMOS  chip,  at  250  Mbit/s  and  300  Mbit/s,  using  a  223-l 
pseudo-random  bit  stream.  10  pW  average  power  (corresponding  to  -20  dbm)  is  seen  to  be  sufficient  at  300  Mbit/s  to  achieve  a 
bit  error  rate  of  less  than  1012. 

In  Table  1  is  summarised  the  measured  electrical  power  budget  of  the  link.  At  300  Mbit/s,  the  LED  is  driven  at  a  drive  current  of 
0.70  mA  peak-to-peak.  Since  the  CMOS  driver  circuit  will  draw  this  current  from  a  power  supply  of  3.3  V,  the  average  power 
dissipation  in  the  LED  +  driver  circuit  will  be  close  to  0.35  mA  X  3.3  V  =  1.15  mW.  The  power  consumption  of  the  CMOS 
detector/receiver  was  measured  to  be  2.8  mW.  The  total  power  consumption  of  a  single  link  is  therefore  only  4  mW.  In  a  parallel 
link  between  chips,  where  emitters  and  detector/receivers  are  evenly  distributed  over  both  chips,  each  optical  I/O  will  therefore 
consume  on  average  only  2  mW,  and  take  up  a  space  of  merely  100  pm  X  100  pm  (in  0.6  pm  CMOS,  and  scalable  in  newer 
CMOS  generations). 


Table  1  :  Power  budget  of  the  link 


Bit  rate 

required 

LED  curren 

LED  power 

Det  /  Rec 

avg.  power 

opticalpow. 

consumptio 

i  power 

consump 

over  3.3  V 

con  sump 

per  chip 

Mbit's 

pWatt 

mA 

mW 

mW 

mW 

250 

7 

0.22 

0.75 

2.8 

1.8 

300 

10 

0.35 

1.15 

2.6 

2.0 

average  optical  power  on  CMOS  chip  (pW) 

Figure  4  :  Bit  error  rate  vs.  average  optical  power 
for  the  link  operating  at  250  Mbit/s  and  300  Mbit/s 


The  power  consumption  and  the  density  of  the  demonstrated  optical  link  compares  very  favourably  to  competing  electrical  and 
optical  solutions.  For  example,  an  electrical  link  consumes  a  power  equal  to  CV2.  Here  C  consists  of  a  fixed  capacitance  of 
about  10  pF  plus  a  contribution  proportional  to  the  link  length  of  at  least  I  pF/cm.  For  10  cm,  the  capacitance  is  therefore  at  least 
20  pF.  Even  when  switching  over  only  1  V,  the  electrical  power  consumption  is  more  than  6  mWatt  at  300  Mbit/s.  When  lurther 
increasing  the  link  speed  or  length,  the  comparison  is  even  more  in  favour  of  the  optical  alternative. 
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ABSTRACT 

We  present  a  numerically  stable  and  computational  simple 
method  for  fast  and  robust  measurement  of  optical  channel 
gain.  By  transmitting  adaptively  designed  signals  through  the 
channel,  good  accuracy  is  possible  even  in  severe  noise  condi¬ 
tions. 

1.  INTRODUCTION 

The  measuring  of  optical  channel  gains  is  a  key  element  in 
many  applications.  Measuring  channel  gains  means  determin¬ 
ing  the  change  in  intensity  when  a  signal  is  transmitted  from 
an  emitter  to  a  receiver.  A  well-known  and  simple  applica¬ 
tion  is  an  automatic  door,  which  responds  whenever  a  person 
is  reflecting  the  emitted  signal,  and  thereby  significantly  in¬ 
creasing  the  channel  gain.  Another  example  is  measuring  the 
thickness  of  paper.  A  more  subtle  example  is  determination 
of  spatial  position  by  comparing  the  intensities  of  a  multitude 
of  reflections  from  a  single  object.  A  typical  way  of  making 
this  type  of  measurements  is  emitting  a  simple  signal,  such  as 
a  harmonic  or  square  wave  signal,  since  they  are  both  easily 
constructed  and  measured  with  analog  electronics.  Such  solu¬ 
tions  have  two  major  disadvantages:  The  signals  are  sensitive 
to  frequency  located  disturbances,  and  it  is  difficult  to  detect 
and  avoid/neutralize  such  disturbances. 

We  propose  a  measuring  method  which  is  highly  accurate 
in  moderate  noise  conditions,  and  less  accurate,  but  very  ro¬ 
bust,  in  severe  noise  conditions.  This  is  achieved  by  using 
two  closely  related  digital  signal  design  algorithms;  a  “best 
case”  and  a  “worst  case”  algorithm.  The  former  is  based  on 
the  wavelet  transform  (WT),  while  the  latter  is  based  on  the 
Rudin-Shapiro  transform  (RST).  They  are  both  simple,  numer¬ 
ically  stable,  and  post-processing  friendly  making  them  ideal 
for  implementation  e.g.  in  a  fixed  point  DSR  By  introducing  a 
signal  processor  it  becomes  possible  to  continuously  redesign 
the  signals  for  improved  SNR,  and  thereby  maintaining  the  ac¬ 
curacy  in  changing  and/or  severe  noise  conditions. 

2.  DESIGNING  THE  DIGITAL  SIGNALS 

The  two  design  algorithms  are  based  on  the  wavelet  packet 
transform  scheme;  it  is  fast,  numerically  stable,  works  well 
in  fixed  point  arithmetics,  and  has  low  program  complexity. 

This  work  is  in  part  supported  by  the  Danish  Technical  Science 
Foundation  Grant  no.  9701481. 

Patent  pending. 


The  best  case  algorithm  uses  the  classical  WT  to  create  sig¬ 
nals  which  are  near-orthogonal  to  expected  noise  occurrences, 
while  the  other  algorithm  uses  the  RST  to  create  an  all-spectrum 
signal,  which  by  nature  has  low  sensitivity  with  respect  to  time 
and  frequency  located  noise  occurrences.  The  difference  is 
essentially  that  the  WT  algorithm  “searches  for  holes”  in  the 
current  noise,  while  the  RST  algorithm  spreads  information  in 
time  and  frequency  to  reduce  the  impact  of  localized  distur¬ 
bances.  The  prefered  method  depends  on  the  noise  conditions. 
If  there  are  easy-to-find  holes  in  the  noise,  the  former  can  pro¬ 
vide  very  accurate  measurements.  If,  however,  the  noise  is  dif¬ 
ficult  to  define  or  is  changing  rapidly,  the  latter  method  pro¬ 
vides  lesser  accurate,  but  more  robust  measurements. 

A  good  introduction  to  the  wavelet  theory  is  Wickerhauser 
[6].  A  mathematically  rigorous  treatment  of  the  subject  is  given 
in  Daubechies  [3],  For  more  material  on  Rudin-Shapiro  poly¬ 
nomials  see  Brillhart  [1] 

2.1.  The  wavelet  transform 

The  WT  based  algorithm  takes  a  simple,  time  localized  sig¬ 
nal  (see  figure  1  for  an  example),  and  inversely  wavelet  packet 
transform  it,  which  results  in  a  frequency  localized  signal.  A 
typical  maximum  spread  is  also  shown  in  figure  1 .  After  trans¬ 
mission  the  signal  is  forwardly  transformed  to  reproduce  the 
original,  now  noisy  simple  signal,  and  by  inner  product  with 
the  “clean”  original  signal  the  transmission  intensity  (the  chan¬ 
nel  gain)  is  determined.  Since  the  original  signal  is  completely 
known,  it  is  also  possible  to  obtain  an  estimated  accuracy  of 
the  channel  gain  measurement.  This  is  accomplished  by  taking 
inner  product  between  the  transmitted,  transformed  signal  and 
a  number  of  signals  orthogonal  to  the  original  signal.  If  these 
quantities  are  small  the  transmission  was  most  likely  subject  to 
only  mild  noise.  This  trick  provides  an  easily  calculated  guide¬ 
line  to  how  much  one  can  trust  the  current  measurement.  If 
each  measurement  is  vital  a  number  of  signal  restoration  pro¬ 
cedures  (not  further  described  here)  can  be  applied.  These  also 
benefit  from  the  complete  knowledge  of  the  original  signal. 
Note  that  the  small  number  of  non-vanishing  coefficients  of 
the  original  signal  in  all  cases  significantly  reduce  the  amount 
of  calculations. 

Because  the  transform  is  linear  and  has  perfect  reconstruc¬ 
tion,  it  is  also  easy  to  make  a  good  sample  by  sample  estimation 
of  the  noise,  which  provides  valuable  information  on  any  dis¬ 
turbances.  This  makes  it  possible  to  do  real  time  adaptation 
of  the  signal.  The  mehtods  combines  the  ability  of  the  WT 
to  produce  predefined  trade-offs  between  time  and  frequency 
information  with  the  freedom  in  design  of  the  original  signal. 
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Figure  1:  Uppermost  a  simple  signal  (here  a  sampled  chirp). 
Below  (in  solid)  the  absolute  value  of  the  Fourier  transform 
of  the  3-scale  inverse  wavelet  packet  transform  with  symlcts 
6  [4],  The  dashed  curve  shows  the  maximal  frequency  spread¬ 
ing  for  any  (suitably  normalized)  signal  with  coefficients  van¬ 
ishing  outside  [128;  159], 


Thereby  it  is  possible  to  adapt  the  method  to  virtually  any  type 
of  noise,  in  particular  disturbances  with  large  temporal  extent. 

2.2.  The  Rudin-Shapiro  transform 

The  RST  is  defined  through  a  slightly  extended  version  of  the 
remarkable  Rudin-Shapiro  polynomials,  introduced  in  1951  by 
H.  S.  Shapiro  in  his  master’s  thesis,  and  published  in  1959  by 
Rudin  [5].  Define  the  polynimals 

•Pm+ 1(*)  =  Pm(x)  +  P0  =  1 

Qm+i(s)  =  Pm(z)-(-i)Smx2mQm(z),  <?c  =  i  n) 

with  Sm  g  {0, 1.}.  It  immediate  follows  that  for  all  |-|  =  1 
|Pm+lj2  +  |<?m+1  T  =  2|Pm|2  +  2|Q„,|2  =  2m+2. 
Consequently, 

max  |Pn»(«s**)|  <  v/2||Pm(e’?)||2,  (2) 

guaranteeing  a  certain  flatness  of  tire  polynomials,  A  construc¬ 
tion  similar  to  (1)  is  found  in  Byrnes  [2],  The  coefficients 
of  the  polynomials  can  also  be  constructed  with  the  Rudin- 
Shapiro  transform,  which  is  really  a  modified  wavelet  packet 
Haar  transform.  Define  the  unitary  transform  H„  :  R:2"  i-> 
R2",  r*  >  1,  as 


y*  1 

~  —  \l 

(— i)fc  r 

Vfc+2"-lJ 

~  vs  L1 

— ( — i)*j 

for  k  =  0, . . .  ,  2"  1  —  1  when  mapping  x  to  y.  Then,  with 
H„  being  the  inverse  of  H„, 

H=n 

n«l 

transforms  the  canonical  basis  for  R2*"  into  the  coefficients  of 
the  2n  possible  Pk(z)' s.  Hence  these  coefficients  constitutes 
an  orthonormal  basis,  which  not  only  consists  of  only  ±1,  but 


H„ 


0 


0 


H„ 


Figure  2:  Uppermost  is  the  result  of  applying  the  inverse 
Rudin-Shapiro  transform  to  the  29th  canonical  basis  vector  of 
length  64.  Below  the  frequency  response  (the  absolute  value  of 
the  corresponding  Pe(el()  polynomial). 


also,  due  to  (2),  has  a  remarkable  frequency  response.  In  fig¬ 
ure  2  is  an  example  of  such  a  basis  element  and  its  frequency 
response.  The  RST  based  algorithm  is  applied  in  much  the 
same  way  as  the  WT  method.  A  simple  signal  is  inversely 
transformed  prior  to  transmission.  This  will  produce  an  all- 
spectruni  signal.  Upon  transmission  the  signal  is  forwardly 
transformed  yielding  the  original,  simple  signal  with  noise. 
The  post-processing  is  equivalent  to  that  of  the  WT  method. 

3.  CONCLUSION 

Two  computationally  simple  and  numerically  robust  algorithms 
for  measuring  optical  channel  gains  were  presented.  One  of 
the  algorithms  provides  excellent  accuracy  in  moderate  noise 
conditions,  while  the  other  has  reduced,  but  very  robust,  ac¬ 
curacy  even  in  severe  noise  conditions.  Combining  the  two 
algorithms,  either  by  applying  the  most  suitable  one,  or  jointly 
in  two  parallel  systems,  is  easy  due  to  their  similar  program 
and  computational  structure,  3nd  the  result  is  a  versatile  op¬ 
tical  channel  g3in  method.  The  low  complexity  and  numer¬ 
ical  stability  of  the  wavelet  packet  transform  scheme  and  of 
the  post-processing  (mainly  inner  products)  also  makes  this  ap¬ 
proach  fast  and  suitable  for  low  cost  hardware  implementation. 
For  the  commercial  aspects  of  these  methods,  please  refer  to 
www .  beamcxontrol .  com. 
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What  can  be  perceived  by  a  human  or  computed  by  a  machine  from  an  image  is 
fundamentally  restricted  by  the  captured  data.  Current  imaging  systems  are  limited  in 
spatial  resolution,  field  of  view,  and  dynamic  range.  In  this  talk,  we  present  new  vision 
sensors  that  provide  unconventional  forms  of  visual  information.  The  first  part  of  the  talk 
focuses  on  the  use  of  catadioptrics  (lenses  and  mirrors)  for  capturing  unusually  large 
fields  of  view.  We  describe  several  methods  for  obtaining  single  viewpoint  and  multi¬ 
viewpoint  images.  The  second  part  of  the  talk  addresses  the  problem  of  acquiring  high 
dynamic  range  images  using  a  low  dynamic  range  detector.  We  present  two  approaches 
for  extracting  the  desired  extra  bits  at  each  pixel;  the  first  one  uses  multiple  images  while 
the  second  uses  just  a  single  image.  Several  interactive  demonstrations  of  our  results  will 
be  shown.  These  results  have  implications  for  digital  imaging,  immersive  imaging,  image 
based  rendering,  3D  scene  modeling,  and  advanced  interfaces. 
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Multispectral  imagers  have  been  widely  investigated  for  their  application  to  target  recognition, 
particularly  in  clutter  [1,2],  industrial  gas  pollutant  monitoring  [3],  and  biomedical  imaging  [4],  For 
example,  target  recognition  is  a  difficult  task  with  passive  detection  of  small  targets  embedded  in  infrared 
image  data.  This  difficulty  arises  because  in  most  cases  the  target  exhibits  only  slight  variation  in 
temperature  from  the  background  clutter.  When  image  data  at  multiple  wavelengths  is  collected,  the  data 
can  be  processed  to  cancel  the  background  clutter  and  improve  the  detectability  of  targets  [1,2].  Many 
methods  have  been  used  to  realize  multispectral  imaging,  however,  each  has  some  significant  difficulties 
[3,4]  which  can  be  resolved  by  using  co-located  detectors  stacked  in  3D  on  top  of  image  processing 
circuitry  [5,6,7], 

Information  processing  using  digital  Si  CMOS  circuits  is  currently  ubiquitous  due  to  the 
increasing  level  of  complexity  of  information  processing  demands  and  capabilities.  Leveraging  the  high 
level  of  Si  signal  processing  complexity  by  creating  “smart”  systems  that  use  Si  circuitry  with  enhanced 
added  functions  is  an  attractive  system  option.  The  most  effective  value-added  components  to  add  to  Si 
CMOS  are  those  that  can  be  independently  optimized,  compromising  neither  the  performance  of  the 
value-added  devices  nor  the  Si  CMOS  circuits.  Herein,  we  discuss  the  design  of  a  “smart”  multispectral 
imaging  system  which  can  be  fabricated  with  independently  optimized  detector  arrays  and  Si  CMOS 
circuits,  which  are  stacked  in  three  dimensions  on  top  of  one  another  for  complete  physical  registration  of 
the  multispectral  image.  This  smart  multispectral  imager,  described  herein,  utilizes  the  heterogeneous 
integration  of  thin  film  GaN  (UV),  MCT  (MWIR),  and  Si  CMOS  circuitry  to  create  an  outstanding 
example  of  the  power  of  heterogeneous  integration  for  enhancing  optical  systems  using  electronic  signal 
processing  capabilities. 

The  smart  multispectral  imager  realizes  the  full  registration  of  multicolor  images  through  the 
integration  of  multiple  stacked  co-located  detectors  onto  Si  CMOS  analog  to  digital  converter  (ADC) 
circuitry,  as  illustrated  (not  to  scale)  in  Figure  1.  A  foundry  digital  Si  CMOS  array  of  sigma  delta  ADCs, 
shown  in  a  photomicrograph  in  Figure  2,  serves  as  the  integration  host  substrate.  These  Si  CMOS  ADCs 
consist  of  one  dedicated  converter  per  image  pixel,  and  the  circuits  have  been  optimized  for  low  noise 
transfer  of  data  off  of  the  focal  plane,  using  per  pixel  digitization  and  filtering  methods  which  can  also  be 
used  to  trade  off  frame  rate  and  resolution.  The  GaN  UV  MSM  and  MCT  MWIR  pn  photodetectors  are 
independently  grown  and  optimized,  and  are  designed  for  thin  film  processing,  integration,  and  stacking 
using  metal/metal  bonding.  This  process  is  illustrated  in  Figure  3. 


Figure  1 .  Schematic  of  integrated  multispectral 
imaging  array  using  3D  stacked  detectors. 


Figure  2.  Photomicrograph  of  Si  CMOS 
ADC  circuit  fabricated  through  MOSIS. 
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In  this  integration  process,  a  stop  etch  layer  is 
grown  lattice  matched  between  the  detector  epilayers 
and  the  growth  substrate  (a).  This  stop  etch  layer  can  be 
removed,  and  does  not  degrade  device  performance. 
The  devices  are  mesa  etched  for  pixellization  (a  partial 
mesa  etch  for  arrays,  to  maintain  spatial  registration 
during  subsequent  bonding  processes)  (b),  and  the 
devices  are  protected  from  the  substrate  removal  etchant 
by  a  handling  layer  such  as  Apeizon  W  (c).  The 
substrate  is  removed  using  a  wet  etch  which  selectively 
etches  the  substrate  and  not  the  stop  etch  layer  (d),  and 
the  stop  etch  layer,  with  another  etchant,  can  then  be 
removed  if  necessary.  The  devices  are  then  bonded  to  a 
transfer  diaphragm  (e,  f),  and  are  aligned  and  bonded  to 
the  host  substrate  (g)  [5,6], 

To  realize  these  thin  film  photodetectors,  the  MCT  MWIR  pn  detectors  are  grown  on  SOI,  and 
the  GaN  UV  photodetectors  are  grown  on  lithium  gallate.  These  structures  enable  the  selective  chemical 
removal  of  the  growth  substrate,  as  illustrated  in  Figure  3,  with  details  in  the  presentation.  To  integrate  the 
Si  CMOS  host  substrate  circuit,  it  is  planarized  and  isolated  with  polyimide,  cured,  vias  are  cut  to  the  Si 
CMOS  contact  pads  using  a  reactive  ion  etcher  (RIE),  and  the  vias  are  metallized  for  contact  to  the  MCT 
(bottom  layer)  photodetectors.  Note  that  the  original  top  metallization  of  the  photodetectors  is  inverted  in 
the  transfer  process,  and  the  devices  are  thus  metal/metal  bonded  to  the  Si  CMOS  circuit,  resulting  in  an 
excellent  mechanical  and  electrical  bond.  As  an  example  of  this  type  of  thin  film  detector  array 
integration,  Figure  4  is  a  photomicrograph  of  an  8x8  array  of  thin  film  GaAs  P-i-N  detectors  bonded 
directly  on  top  of  Si  CMOS  circuits  [7]. 


(a)  (b)  (c)  (d) 

Figure  4.  Photomicrographs  of  8x8  integrated  array:  (a)  Si  CMOS  circuit;  (b)  thin  film  P-i-N  detectors  bonded  to 
planarized/via  cut/metallized  circuit;  (c)  interstitial  layer  removed;  (d)  planarization/isolation,  top  via,  and  top  metal 
contact  completed. 

Since  the  MCT  MWIR  detectors  are  only  microns  thick,  they  can  be  planarized  with  polyimide  or 
BCB  with  subsequent  top  contact  metallization  (which  can  be  utilized  as  a  shared  contact  with  the  GaN). 
To  fully  separate  the  MCT  and  GaN  contacts,  the  MCT  top  contacts  can  be  isolated  with  polyimide  and 
the  top  MSM  contact  metallization  for  the  GaN  UV  photodetector  array  deposited.  Next,  the  GaN  thin 
film  MSM  detectors  are  bonded,  with  contacts  facing  down  to  the  metallized  MCT  array,  thus  completing 
the  imaging  array.  Processing  details  and  projected  performance  of  the  integrated  multispectral  imaging 
array  will  be  discussed  in  the  presentation. 
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Abstract 

A  photoreceiver  array  is  presented  that  exploits  the  features  of  near  field  optics  to  provide  sub-nanometer  spatial 
resolution.  Comprised  of  a  photonic  CMOS  photoreceiver  chip  and  a  MEMS  based  aperture  array,  the  near-field 
photoreceiver  array  is  implemented  using  standard  fabrication  techniques. 

Summary 

It  has  been  suggested  that  ultra  high-density  optical  storage  devices  could  be  realized  by  storing  data  in  patterns  with 
spatial  coordinates  below  the  far-field  resolution  limit.[l]  While  the  ability  to  write  data  on  these  fine  dimensions 
has  been  shown,  the  ability  to  read  data  with  sub-lambda  spatial  resolution  has  proven  problematic.  This  is 
especially  true  for  memory  systems  that  require  page-oriented  memory  access. 

In  this  paper  we  describe  a  novel  photoreceiver  array  technology  that  exploits  the  features  of  near  field  optics[2]  to 
provide  sub-nanometer  resolution  capabilities.  Specifically,  each  optical  input  window  in  the  array  uses  a  sub¬ 
wavelength  aperture  to  diffract  evanescent  waves  into  the  silicon  based  detector  of  a  high  gain  photoreceiver  circuit. 
With  this  aperture  structure,  it  is  possible  to  transform  evanescent  waves  with  high  spatial  frequencies  (on  the  order 
of  the  wave  number  of  the  evanescent  waves)  into  lower  frequency  waves  that  propagate  (with  out  evanescent 
decay)  into  the  photodetector  structure.[2] 

Implemented  with  a  photonic  CMOS  photoreceiver  array  and  a  silicon  micro  machined  aperture  array,  our  approach 
is  based  on  standard  fabrication  processes.  The  near-field  photoreceiver  array  consists  of  a  5x5  CMOS  based 
photoreceiver  array.  Each  individual  photoreceiver  module  consists  of  a  silicon  based  photodetector,  a  multistage 
current  amplifier  and  a  thresholding  circuit  that  allows  the  photoreceiver  sensitivity  to  be  tuned  over  a  wide  range  of 
optical  power  levels.[3]  As  suggested  by  the  circuit  diagram  shown  in  figure  1,  the  entire  photoreceiver  module 
(including  the  PNP  phototransistor)  is  based  on  a  conventional  CMOS  device  technology.  Our  current  version  of 
the  photoreceiver  chip  has  been  fabricated  through  the  MOSIS  foundry  service  using  a  1.2  |tm  CMOS  device 
technology. 


Figure  1  .Circuit  diagram  for  the  CMOS  based  photoreceiver  module.  The  phototransistor  is  implemented 

with  a  p+  emitter  region  implant  in  an  N-well  base  region.  The  p-type  substrate  acts  as  the  collector 
region  in  the  resulting  pnp  transistor  structure. 
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Figure  2  Cross  section  of  the  near  field  photoreceiver  array  showing  the  photoreceiver  chip  and  the 
micro-machined  aperture  array. 

While  the  photoreceiver  circuit  is  an  important  component  of  our  device  technology,  the  aperture  array  is  the 
fundamental  component  designed  to  enable  data  detection  with  near-field  resolution.  Using  micro-machining 
technology  pioneered  for  fabrication  of  micro  electro-mechanical  systems  (MEMS),  the  aperture  array  is  realized  by 
etching  Silicon  with  a  KOH/IPA  etching  solution.  This  anisotropic  etch  step  is  used  to  machine  via  like  structures 
through  to  a  Si02  film  on  the  back  side  of  a  thinned  Silicon  substrate.  Finally,  sub-micron  sized  apertures  are  milled 
through  the  Si02  film  using  a  commercial  focused  ion  beam  milling  system. 

After  independent  fabrication  of  the  photoreceiver  array  and  the  aperture  array,  the  near-field  photoreceiver  array  is 
completed  by  mechanically  joining  the  two  components.  Figure  2  shows  a  composite  view  of  the  completed 
structure.  As  indicated  in  the  figure,  mechanical  alignment  features  must  be  included  on  both  the  CMOS  chip  and 
the  aperture  array  to  ensure  proper  x/y  alignment  of  the  aperture  array  with  respect  to  the  photodetectors  in  the 
photoreceiver  array.  For  the  aperture  chip,  anisotropic  etching  is  used  to  define  a  set  of  4  posts  on  the  top  surface  of 
the  aperture  array.  Due  to  the  fast  off  axis  etching  of  pyramidal  comer  feature  in  Silicon,  proper  etching  of  the  post 
structures  requires  the  inclusion  of  comer  compensation  techniques[4]  in  the  photolithography  mask  design.  For  the 
CMOS  chip,  pit  features  are  etched  into  the  CMOS  photoreceiver  chip  using  a  post  fabrication,  anisotropic  etching 
procedure  on  the  packaged  CMOS  chips.  Note  that  fabrication  of  these  features  requires  the  inclusion  of  openings 
through  to  the  CMOS  chip  substrate.  These  “Open”  features  have  been  defined  in  the  VLSI  CAD  tool  as  a 
composite  layer  comprised  of  existing  oxide  via  layers. [5] 

In  this  paper  we  present  a  detailed  description  of  both  the  photoreceiver  circuit  and  the  aperture  array  fabrication 
method.  Independent  characterization  of  both  the  photoreceiver  circuit  and  the  aperture  array  will  be  included. 
Finally,  the  individual  components  are  packaged  together  and  the  expected  near-field  resolution  is  quantified. 
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Introduction 

Blindness  robs  millions  of  individuals  of  the  keen  sense  of  vision,  leaving  them  in  perpetual  darkness.  Several 
groups  around  the  world  are  evaluating  the  feasibility  of  creating  visual  perception  by  electrical  stimulation  of  the 
remaining  retinal  neurons  in  patients  blinded  by  photoreceptor  loss  [1-5].  Artificial  vision  as  provided  by  the 
prosthetic  system  of  a  multiple-unit  artificial  retina  chipset  (MARC)  implanted  within  the  eye  holds  a  promise  to 
restore  useful  vision  to  a  large  subset  of  patients  who  are  blind  from  retinal  disorders.  The  system  would  have  an 
external  camera  (mounted  in  a  glasses  frame)  to  acquire  an  image  and  convert  it  into  an  electrical  signal.  This  signal 
would  be  wirelessly  transmitted  to  an  implanted  chip,  which  would  electrically  stimulate  the  non-degenerated  cells 
of  the  retina.  The  pattern  of  electrical  stimulation  would  be  controlled  to  produce  the  perception  of  an  image. 
Hence,  the  sense  of  vision  could  be  partially  restored  with  such  an  implantable  electronic  device. 

The  human  eye  with  its  nearly  100  million  photoreceptors  is  a  complex  system  that  allows  us  to  enjoy  high 
resolution  imagery  full  of  colors.  The  system  functions  when  light  is  focused  by  the  cornea  and  crystalline  lens 
within  the  eye  on  to  the  retina,  a  delicate  tissue  that  lines  the  inside  of  the  eye  ball.  The  retina  is  highly  structured, 
composed  of  multiple  cell  layers,  each  with  a  specific  function.  The  photoreceptor  cell  layer  captures  light  energy 
and  converts  it  into  electrochemical  signals.  These  signals  then  excite  bipolar  cells  located  in  the  second  layer  of 
neurons  in  the  retina.  The  bipolar  cells  subsequently  transmit  the  signal  to  retinal  ganglion  cells  whose  axons 
collectively  form  the  optic  nerve  that  connects  the  retina  to  the  visual  centers  of  the  brain.  Other  cells  such  as  the 
horizontal  and  amacrine  cells  form  connections  between  neurons  within  a  layer,  adding  to  the  computational  power 
of  the  retinal  network  of  neurons.  This  biological  neural  network  is  a  sophisticated  image  processor,  capable  of 
compressing  information  from  100  million  photoreceptors  into  1  million  retinal  ganglion  cells. 

Patients  blind  from  retinitis  pigmentosa  (RP)  and  age-related  macular  degeneration  (AMD),  2  prevalent  retinal 
disorders,  could  regain  vision  with  MARC  is  developed.  AMD  is  the  leading  cause  of  blindness  in  individuals  60 
years  or  older,  with  200,000  eyes  left  legally  blind  each  year.  RP  has  an  incidence  of  1  in  4,000  and  in  the  US  alone 
afflicts  100,000  people.  Currently,  no  treatment  exists  for  either  RP  or  AMD.  We  have  shown  that  in  these  diseases 
despite  near  total  loss  of  the  photoreceptors,  the  remaining  retinal  neurons  remain  intact.  Is  there  a  way  to  jump- 
start  these  remaining  awaiting  neurons  and  thus  create  the  sensation  of  vision? 


Experiments 

In  15  blind  human  volunteers  tested,  we  have  shown  that  the  remaining  neurons  in  a  non-seeing  eye  can  be  activated 
by  an  electrical  signal,  resulting  in  the  perception  of  light.  The  human  tests  were  performed  in  an  operating  room, 
where,  under  local  anesthesia,  different  custom-built  stimulating  microelectrode  arrays  were  placed  into  the  eye  of  a 
blind  subject  and  positioned  near  the  retina.  Then  patterns  of  electrical  signals,  generated  by  an  external  computer, 
were  applied  to  the  retina  via  the  electrode  arrays.  The  subjects  were  asked  to  describe  the  resultant  perceptions.  At 
first,  we  were  concerned  and  unsure  whether  we  could  create  one  small  spot  of  light  that  would  correspond  to  the 
retinal  area  being  stimulated.  Once  we  were  able  to  get  patients  to  see  one  discrete  spot  of  light  localized  to  the 
retinal  area  being  stimulated,  the  next  question  that  needed  to  be  answered  was  could  we  create  an  image  by  eliciting 
the  perception  of  multiple  spots  of  light  (i.e.,  akin  to  how  a  dot-matrix  printer  creates  images)?  Could  the  brain  make 
sense  of  more  than  one  artificial  signal  at  a  time  or  would  the  spots  of  light  merge  into  an  incoherent  blur?  Using  a 
25-electrode  array,  we  stimulated  at  first  rows  and  columns  of  electrodes  and  then  patterns  of  electrodes  outlining  a 
large  letter  or  simple  geometric  shapes  such  as  a  rectangle.  In  all  our  experiments  it  took  several  minutes  before  the 
patient  could- confidently  identify  the  artificially  created  spot  of  light.  Once  they  recognized  the  first  dot  of  light 
they  made  quick  progress.  As  expected,  the  25-electrode  array  provided  poor  resolution,  however,  it  was  sufficient 
for  all  the  patients  tested  to  date  to  see  forms  such  as  a  large  letter  and  simple  geometric  shape. 
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Progress  Results 

Our  human  tests  have  also  demonstrated  that  the  artificial  vision  created  by  controlled  electrical  stimulation  of  the 
retina  has  color.  Patients  have  seen  yellow,  green  and  blue  dots  of  light  although  as  of  yet  we  do  not  have  a  firm 
understanding  on  how  to  reliably  elicit  seeing  one  color  over  another.  We  could  also  set  the  stimulation  frequency 
at  a  sufficiently  high  rate  so  that  the  perceptions  would  not  flicker  like  a  strobe  light  but  instead  appear  continuous. 
Our  results  from  tests  in  blind  human  volunteers  demonstrate  that  form  vision  is  possible  using  controlled  pattern 
electrical  stimulation  of  the  retina.  In  order  to  determine  the  degree  of  visual  acuity  that  is  possible  with  such  an 
approach,  an  implantable  MARC  is  currently  being  developed  so  that  the  patient  can  become  accustomed  to 
processing  a  new  type  of  visual  input. 

Concomitant  with  the  human  tests,  we  have  been  engineering  a  MARC,  a  completely  implantable  system  capable  of 
creating  hundreds  of  individual  spots  of  light.  Specifically,  we  had  to  develop  a  chip  that  would  be  small  enough  to 
be  placed  within  the  eye  and  yet  could  stimulate  the  retina  through  a  large  number  of  electrodes.  The  retinal 
stimulation  pulse  has  to  excite  the  neurons  but  not  damage  them.  To  date,  we  have  developed  a  chip  that  is  4x4  mm 
and  can  stimulate  100  electrodes  with  a  safe  and  effective  charge.  This  represents  a  first  generation  chip  and  the 
design  of  a  625-channel  stimulator  is  progressing.  Other  system  components  are  also  under  development.  A 
prototype  image  acquisition  using  miniature  CMOS  camera  and  image  processing  system  using  a  VLSI  image 
processing  board  has  been  built.  Wireless  communication  electronics  using  a  radio  frequency  link  has  been 
developed  and  it  can  supply  the  required  data  transmission  rate  and  power  coupling.  Stimulating  electrode  materials 
are  being  tested  to  assess  their  ability  to  withstand  corrosion  within  the  eye. 

Of  equivalent  importance  to  the  engineering  development  of  the  device  is  the  biological  testing  that  will  show  that  a 
retinal  implant  is  well  tolerated  in  the  eye.  Although  intraocular  retinal  surgical  procedures  are  commonly 
performed  there  is  no  precedence  to  attaching  a  device  to  the  delicate,  paper-thin  retina.  The  device  could  tear  the 
retina,  cause  intraocular  bleeding,  or  dislodge  because  of  eye  movements.  We  have  used  metal  alloy  tacks  approved 
by  the  FDA  for  intraocular  use  to  secure  the  implant.  In  animal  experiments  to  date,  the  implants  have  remained 
affixed  to  the  retina  through  1 1  months  and  caused  no  structural  of  functional  damage.  Experiments  in  progress  will 
examine  the  effects  of  electrical  stimulation  on  the  retina. 

Conclusion 

The  artificial  retinal  prosthesis  is  a  complex  project,  bridging  the  fields  of  optics,  microelectronics,  retinal  surgery, 
electrophysiology,  and  psychophysics.  Our  team  consists  of  medical  doctors,  scientists,  and  engineers  each  of  who 
brings  a  unique  and  necessary  experience  that  will  enable  the  development  of  the  retinal  prosthesis  system.  The 
initial  human  tests  and  engineering  developments  have  generated  a  great  deal  of  excitement  not  only  in  the  scientific 
community  but  also  in  the  mainstream  media.  Moreover,  because  of  the  quality  of  our  research  team  and  the  decade 
of  work  that  we  have  invested  developing  MARC,  we  believe  that  the  first  prototype  human  implant  MARC  will  be 
available  in  2-3  years.  And  while  the  implantation  of  a  retinal  prosthesis  would  be  a  significant  achievement,  it 
would  mark  only  the  start  of  more  research.  Expected  advances  in  microelectronics  will  allow  more  sophisticated 
implants.  Image  processing  algorithms  will  enhance  the  electronically  created  perception.  The  implications  of  this 
research  may  extend  beyond  this  immediate  project,  as  contributions  are  made  to  the  overall  field  of  chronically 
implanted  prosthetic  devices,  telemetric  monitoring  and  control,  and  hermetic  packaging.  The  observations  and 
clinical  and  engineering  experiments  performed  should  lend  insight  into  the  actual  functioning  of  the  human  retina. 
The  feedback  gained  by  these  studies  should  provide  a  vehicle  for  further  understanding  of  the 
retinal/vision/perception  process . 
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OPTICS  COUPLED  WITH  ELECTRONICS  ENABLES  IMAGING 
SYSTEMS  WITH  PREVOUSLY  IMPOSSIBLE  PERFORMANCE 

W.  Thomas  Cathey*+,  Edward  R.  Dowski\  Gregory  E.  Johnson**,  Robert  H.  Cormack+,  and 
Hans  B.  Wach* 

The  potential  advantages  of  jointly  designing  a  combined  optical/digital  imaging  system 
have  long  been  known. 1  Only  recently,  however,  have  several  demonstrations  of  this  been  done. 
These  include  extending  the  depth  of  focus  of  an  imaging  system,2'34  aberration  invariance,5'6 
passive  ranging,7'8  wide-spectrum  imaging,  and  high-performance  single-lens  imaging.  Progress 
in  designing  special  optical/digital  imaging  systems  should  greatly  accelerate  with  the  advent  of 
new  design  software. 

Progress  in  many  applications  where  misfocus  and  a  range  of  focus  is  involved  was 
stimulated  by  the  recognition  that  the  optical  transfer  function  (OTF)  of  an  imaging  system  can 
be  seen  over  a  region  of  misfocus  in  a  single  plot  that  is  mathematically  equivalent  to 
Woodward’s  ambiguity  function.9  10  Woodward’s  function  can  be  used  to  easily  visualize  the 
OTF  of  a  one-dimensional  imaging  system  over  a  large  range  of  misfocus  values.  This  has 
allowed  the  mathematical  lore  that  was  developed  for  radar  to  be  used  in  the  analysis  and  design 
of  imaging  systems  that  use  or  discard  misfocus  information.  The  extension  of  the  depth  of 
focus  of  an  imaging  system  is  an  example  where  the  misfocus  or  depth  information  is 
discarded.2'3,4  In  this  case,  the  optical  portion  of  the  system  is  modified  so  that  the  OTF  of  the 
image  acquisition  system  (the  optics)  changes  very  little  with  misfocus.  So,  rather  than  having 
drastically  changing  OTFs  with  nulls  at  different  locations  for  different  portions  of  an  extended 
object,  the  OTF  essentially  does  not  change  and  has  no  nulls.  The  output  of  the  optical  portion 
of  the  system,  the  “intermediate  image”,  is  detected  and  processed  to  obtain  the  final  image.  The 
digital  processing  is  object  independent  and,  because  the  OTF  is  unchanging,  independent  of 
object  location.  The  image  resolution  is  unaffected.  However,  the  resulting  OTF  has  spatial 
frequency  regions  where  the  response  is  lower  than  the  in-focus  unaltered  system. 
Consequently,  there  is  a  signal-to-noise  (S/N)  loss  in  the  final  image  of  an  extended-depth-of- 
focus  (EDF)  imaging  system  in  comparison  to  a  conventional  in-focus  image.  The  S/N  loss  is 
proportional  to  the  increase  in  the  depth  of  field. 

EDF  imaging  systems  are  invariant  to  misfocus-type  aberrations  such  as  lateral  chromatic 
aberration  and  curvature  of  field.  For  example,  if  red,  blue,  and  green  focus  in  different  planes, 
and  each  focus  is  extended,  the  three  focal  planes  become  thick  blocks,  and  overlap.  Hence, 
there  is  an  overlap  region  where  all  three  colors  are  in  focus.5,6  Similarly,  a  curved  thin  focal 
surface  in  curvature  becomes  a  thick  arc  into  which  a  flat  detector  array  can  be  fitted. 

Hybrid  optical/digital  imaging  systems  with  an  extended  depth  of  field  have  been  used  in 
image-based  barcode  readers,  label  readers,  and  in  microscopes.11,12 

Two  types  of  passive  ranging  systems  have  been  developed.  One  is  based  on  range 
estimation  and  the  modification  of  the  optical  portion  of  the  system  to  have  an  OTF  with 
periodically  spaced  zeros.7  The  period  of  these  zeros  changes  with  misfocus,  and  the  application 
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of  spectral  estimation  to  a  cluster  of  pixels  that  detect  a  portion  of  the  scene  will  give  an  estimate 
of  the  misfocus,  and  hence,  the  range.  The  trade-off  is  that  the  lateral  resolution  is  reduced.  A 
disadvantage  of  this  approach  is  that  if  there  are  two  objects,  or  an  object  with  parts  at  several 
ranges,  the  system  will  give  a  range  estimate  is  a  weighted  average  of  the  several  ranges. 

The  other  approach  is  to  modify  the  optics  so  that  there  is  a  peak  in  the  OTF  as  a  function  of 
misfocus  or  range.8  This  peak  moves  with  misfocus,  or  position  of  the  object.  In  this  case, 
several  range  “bins”  are  created  by  detecting  the  energy  that  lies  in  the  bins.  This  provides  a 
range  detection  technique  that  can  detect  several  objects  at  different  ranges  within  the  portion  of 
the  scene  that  corresponds  to  the  cluster  of  pixels  being  processed.  Again,  the  lateral  resolution 
is  compromised. 

New  optical/digital  design  tools  have  been  developed  that  are  a  combination  of  traditional 
ray  tracing  programs  to  model  the  optics  and  custom  software  to  model  the  interaction  between 
the  optics  and  digital  processing.  These  tools  allow  both  the  optics  and  digital  processing 
portions  of  the  imaging  system  to  be  designed  and  optimized  simultaneously. 


■"Imaging  Systems  Laboratory,  University  of  Colorado,  Boulder,  Colorado  80304-0425,  http://www.colorado.edu/isl 
+  CDM  Optics,  Inc.,  Suite  390, 4001  Discovery  Drive,  Boulder,  Colorado  80303-7816,  http://www.cdm-optics.com 
This  research  was  sponsored  in  part  by  the  U.S.  Army  Research  Office  under  grant  DAAG55-97-1-0348.  The 
information  herein  does  not  necessarily  reflect  the  position  of  the  federal  government. 
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Analog  to  digital  (A/D)  conversion  is  an  important  component  in  any  system,  which  must  sense  the  natural 
environment  and  process  that  information  using  digital  techniques.  Because  of  the  difficulty  in  achieving  both  high- 
resolution  and  high-speed  A/D  converters,  this  A/D  interface  has  been  and  continues  to  be  a  barrier  to  the  realization 
of  high-performance  systems.  Photonic  approaches  to  A/D  conversion  provide  many  advantages  including  high¬ 
speed  sampling  sources  with  extremely  low  clock  jitter,  ultra-fast  nonlinear  optical  device  switching,  decoupling  of 
the  sampled  and  sampling  signals  when  the  sampled  signal  is  electronic  and  the  sampling  signal  is  optical,  and  the 
inherent  parallelism  that  optics  provides.  Here  we  report  a  new  approach  to  photonic  A/D  conversion  using  a 
distributed  neural  network,  oversampling  techniques,  and  a  smart  pixel  hardware  implementation.  In  this  approach, 
the  input  signal  is  first  sampled  at  a  rate  higher  than  that  required  by  the  Nyquist  criterion  and  then  presented 
spatially  as  the  input  to  a  two-dimensional  error  diffusion  neural  network  consisting  of  M  xN  neurons,  each 
representing  a  pixel  in  the  image  space.  The  neural  network  processes  the  input  oversampled  analog  image  and 
produces  an  MxN  pixel  binary  or  halftoned  output  image.  By  design  of  the  neural  network,  this  halftoned  output 
image  is  an  optimum  representation  of  the  input  analog  signal.  Upon  convergence,  the  neural  network  minimizes  an 
energy  function  representing  the  frequency-weighted  squared  error 
between  the  input  analog  image  and  the  output  halftoned  image. 

Decimation  and  low-pass  filtering  techniques,  common  to  classic 
one-dimensional  oversampling  A/D  converters,  digitally  sum  and 
average  the  MxN  pixel  output  binary  image  using  high-speed 
digital  electronic  circuitry.  By  employing  a  two-dimensional  smart 
pixel  neural  approach  to  oversampling  A/D  conversion,  each  pixel 
constitutes  a  simple  oversampling  modulator  thereby  producing  a 
distributed  A/D  architecture,  shown  in  Figure  1.  Spectral  noise 
shaping  across  the  array  diffuses  quantization  error  thereby 
improving  overall  signal-to-noise  ratio  (SNR)  performance.  Here, 
each  quantizer  within  the  network  is  embedded  in  a  fully- 
connected,  distributed  mesh  feedback  loop  which  spectrally  shapes 
the  overall  quantization  noise  thereby  significantly  reducing  the 
effects  of  component  mismatch  typically  associated  with  parallel  or 
channelized  A/D  approaches.  The  two-dimensional  neural  array 
provides  higher  aggregate  bit  rates  which  can  extend  the  useful 
bandwidth  of  photonic-based,  oversampling  A/D  converters. 

The  performance  of  this  distributed  approach  to  A/D 
conversion  can  be  described  in  several  different  ways.  The  SNR  can  be  shown  to  be 

3  2  N  + 1  2  N  + 1 

SQNR  (M,N)= - Jzr-M2N  +  l,  (1) 

max  2  K 

where  M  is  the  oversampling  ratio  and  N  is  the  order  of  the  error  diffusion  filter.  This  equation  describes  the 
performance  of  an  Nth-order  oversampling  modulator,  part  of  an  oversampling  A/D  converter.  The  difference  here 
is  that  the  oversampling  ratio  is  proportional  to  the  output  image  size  and  therefore  can  be  larger  than  classic 
oversampling  ratios.  Component  mismatch  effect  tolerance  can  be  quantified  by  considering  the  error  diffusion 
neural  network  performance  under  these  conditions.  As  an  example,  the  effect  of  threshold  mismatch  across  the 
array  can  be  characterized  as  a  modification  to  the  neural  network  energy  function  according  to 

E  =  (y-x)r  A(y-x)  +  e/rA_1e/  -2ef  (y-x),  (2) 


Figure  1.  Fully-connected,  distributed 
oversampling  A/D  converter  architecture. 
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where  et  is  a  vector  containing  the  threshold  mismatch  of  the  array.  This  first  term  in  Eq.  (2)  is  the  same  as  in  the 
original  error  diffusion  neural  network.  The  last  two  terms  are  contributions  from  threshold  mismatch.  The  matrix  A 
is  derived  from  the  error  diffusion  filter  weights  and  has  high-pass  spectral  characteristics.  Eq.  (2)  demonstrates  that 
the  contribution  of  the  threshold  mismatch  to  performance  is  spectrally-shaped  by  the  error  diffusion  filter  thereby 
reducing  the  overall  impact. 

In  contrast  to  the  majority  of  current  smart  pixel  research,  the  architecture  reported  here  is  a  mixed  signal 
application  of  smart  pixel  technology.  The  optoelectronic  circuitry  for  a  proof-of-concept  5  x5  error  diffusion  neural 
network  was  implemented  in  0.5  pm  silicon  CMOS  technology  requiring  the  functionality  of  one-bit  quantization, 
subtraction,  neuron-to-neuron  weighting  and  interconnection,  and  optical  input  and  output.  Figure  2  shows  the 
circuitry  associated  with  a  single  neuron  of  this  error  diffusion  neural  network  implemented  using  CMOS-SEED 
smart  pixel  technology.  Here,  the  one-bit  quantizer  is  implemented  using  a  wide-range  transconductance  amplifier 
operated  in  the  subthreshold  regime.  The  slope  of  this  sigmoidal  function  was  carefully  designed  by  matching 
MOSFET  transistors  to  meet  the  -j- 

convergence  criteria  and  the  nonlinear  » lr|  H.',‘  Hi:  l^’.v 

dynamics  of  the  error  diffusion  neural  ^  ^  Er  „*  Er  j  |t“." 

network.  All  state  variables  in  this  -  ~*~  r  - i  jfph”  T- 

circuit  are  represented  as  currents  and  •;>  jTLp  »■  jfl  |t  ■»  «•  s  |  jjl _ js**, 

therefore  the  subtraction  functionality  A4  i  i  ^  ^  l  l  l  . . . 

was  implemented  using  current 

summation  techniques  at  specified  - 

nodes.  The  error  weighting  and  - 

distribution  circuitry  for  the  5  x5  array  sj-p-j s  .’.t.  J.1.. sjjjr »'.«  [3  |s — |s 

was  implemented  in  silicon  circuitry  by 

designing  and  matching  the  width-to- 

length  ratios  of  both  the  individual  and  ip-js  V',  “V  ip-J=  ==  4=  T-  -fs  ^  4s  “J” 

stage-to-stage  MOSFET  transistors.  a;-.  j  J  t\  j 

Bi-directional  error  currents  were  &sjj|3£ 

implemented  to  provide  the  circuitry  _  .  . -*• 

and  the  network  with  fully-symmetric 

performance.  Optical  input  and  output  :trip-|s*.'S 

to  each  neuron  was  implemented  using 

SEED  multiple  quantum  well  (MQW)  FlZure  2-  Circuit  dia8ram  °f a  sin8le  neuron  and  a  sinSle  error  weiSht  °f a 

modulators.  The  input  MQW  is  5x5  CMOS-SEED  error  diffusion  neural  network  smart  pixel  architecture. 

reverse-biased  and  functions  as  an  optical  detector  while  the  output  optical  binary  signal  is  produced  using  a  forward 
biased  MQW,  which  provides  photoemission  through  electroluminescence. 

The  central  neuron  of  the  smart  pixel  array  consists  of  approximately  160  transistors  while  the  complete  5  x5 
array  accounts  for  over  3600  transistors.  The  central  neuron  is  interconnected  to  the  surrounding  24  neurons  in  the 
5x5  array  using  a  fixed  interconnect  and  weighting  scheme.  SPICE  simulations  of 
each  of  the  functional  components  as  well  as  the  complete  5  x5  nonlinear 
dynamical  neural  network  were  performed  using  transistor  parameters  extracted 
from  a  previous  0.5  (im  MOSIS  foundry  run  and  have  verified  both  functional 
operation  as  well  as  overall  network  performance.  Individual  component 
functionality  was  experimentally  characterized  and  dynamic  operation  of  the  full 
5x5  neural  array  was  also  experimentally  characterized.  Figure  3  shows  a  CCD 
image  of  the  operational  array  under  50%  gray-scale  condition.  Dynamic 
stimulus  of  single  and  multiple  neurons  was  also  conducted  and  demonstrated 
correct  error  diffusion  and  network  operation. 


Figure  2.  Circuit  diagram  of  a  single  neuron  and  a  single  error  weight  of  a 
5x5  CMOS-SEED  error  diffusion  neural  network  smart  pixel  architecture. 


Figure  3.  CCD  image  of  5x5  smart 
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Abstract 

We  proposed  a  general  purpose,  smart  pixel  based  photonic 
information  processing  architecture  comprised  of  RISC 
processing  pixels  and  a  reconfigurable  micro  instruction  store. 
Implemented  with  a  photonic  VLSI  technology  the  device  is 
suitable  for  prototyping  photonic  information  processing 
systems. 

Summary 

In  the  past  decade,  the  development  of  smart-pixel  device 
technologies  has  fueled  the  promise  for  information 
processing  systems  that  would  merge  the  areas  of  optical 
imaging  and  information  processing^  1]  As  illustrated  by  the 
development  of  data  processing  systems,  [2,3]  application  of 
smart  pixel  technology  has  gone  beyond  the  design  of  smart 
image  processing  systems.  Currently,  the  trend  is  towards  the 
development  of  photonic  VLSI  technologies  that  combine  the 
massive  parallelism  and  high  data  transfer  rates  associated 
with  photonic  device  and  the  high  density,  reliable  and  cost 
effective  information  processing  capabilities  associated  with 
CMOS  logic  circuitry.  [4] 

While  photonic  VLSI  chips  enable  unprecedented 
functionality,  designing  devices  with  integrated  optical  and 
electronic  components  is  not  trivial.  Beyond  providing 
functionality,  optimization  of  chip  performance  requires 
consideration  of  several  factors  including:  information  loss 
associated  with  the  conversion  between  the  two  forms  of 
energy,  the  disparity  between  optical  and  electronic  switching 
speeds,  and  the  size  mismatch  between  optoelectronic  devices 
and  CMOS  transistors.  Often  individual  pixel  space  is 
constrained  by  the  need  to  fit  a  large  array  of  pixels  onto  a 
single  chip. 

For  these  reasons  and  others,  smart-pixel  systems  based  on 
photonic  VLSI  devices  are  typically  implemented  as  custom 
designed  ASIC’s  that  are  well  optimized  for  a  target 
application.  This  approach  can  produce  component  designs 
that  offer  superior  performance  and  efficiency  over 
conventional  processors  and  software.  However,  the  design 
and  implementation  of  these  special  purpose  chips  can  also  be 
very  time  consuming  and  costly.  Thus,  custom  designed 
photonic  VLSI  systems  are  not  well  suited  for  photonic  test¬ 
bed  systems  where  information  processing  algorithms  and 
system  architectures  can  be  refined  through  the  use  of  a  rapid 
prototyping  research  and  experimentation  environment. 

To  overcome  these  difficulties,  we  have  proposed  a  generic 
information  processing  architecture  based  on  smart  pixel 


concepts  implemented  in  a  photonic  VLSI  device  technology. 
We  will  refer  to  this  architecture  as  CASPR  (Configurable 
Architecture  for  Smart  Pixel  Research). 

Conceptually,  the  CASPR  architecture  is  comprised  of  an 
array  of  small,  optically  accessible,  reduced  instruction  set 
computer  (RISC),  pixels  that  are  controlled  through  a  user 
programmable  instruction  set  that  is  stored  on  the  CASPR 
chip.  Thus,  the  CASPR  architecture  provides  a  generically 
programmable  smart  pixel  environment  that  can  be  configured 
for  a  variety  of  photonic  information  processing  applications. 
Further,  the  CASPR  design  is  simple  enough  that  a  large  pixel 
array  can  fit  on  a  single  CMOS  chip. 

As  shown  in  Figure  1,  each  CASPR  processing  cell  contains  a 
photodetector  for  receiving  data,  some  storage  in  the  form  of 
registers  or  RAM,  a  simple  ALU  and  a  network  interface  to 
facilitate  communications  outside  of  the  pixel.  Each  smart 
pixel  element  can  perform  computations  on  the  data  in  its 
internal  storage,  the  data  received  from  other  pixels  and/or  the 
optical  data  sampled  through  the  photoreceiver.  The  network 
interface  allows  pixels  to  communicate  and  pass  data  in  a 
number  of  ways,  in  addition  to  allowing  a  host  computer  to 
issue  instructions  to  designated  cells.  Thus,  each  cell  (or 
pixel)  can  receive  data  optically  and  process  that  data  with  the 
help  of  the  other  cells  in  the  array. 

The  processing  elements  are  controlled  in  a  partitionable- 
SIMD  fashion  that  dose  not  restrict  all  cells  to  execution  of 
the  same  instruction.  Rather,  we  relax  that  restriction  of 
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Figure  1  Block  Diagram  of  a  CASPR  smart  pixel.  These 
devices  can  be  tiled  to  form  and  array  of  generically 
programmable  smart  pixel  based  photonic  information 
processing  system. 
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conventional  SIMD  architectures^]  by  allowing  instructions 
from  the  host  computer  to  be  directed  at  specific  cells  or 
collections  of  cells  based  on  a  simple  addressing  scheme. 
Sending  commands  to  the  entire  array  can  perform  vector 
operations,  while  sending  commands  to  selected  cells  (via  an 
address  and  a  mask)  allows  for  different  control  paths 
between  cells.  Predication  allows  any  given  cell  to  ignore 
certain  instructions  based  on  its  current  state.  Predication  of 
instructions  provides  a  simple  form  of  conditional  execution, 
without  the  need  for  multiple  instruction  streams. 

To  further  improve  flexibility,  the  instruction  set  visible  to  the 
host  computer  is  not  defined  by  the  architecture  [6],  A 
programmable  control  store  (implemented  as  a  cache)  is 
located  on  each  CASPR  chip  to  map  incoming  instructions  to 
control  signals  (an  expanded  microinstruction  format)  before 
broadcasting  them  to  all  the  cells  within.  The  contents  of  the 
control  store  can  be  configured  such  that  the  user  can  define 
and  optimize  an  instruction  set  for  the  application  of  interest. 
This  allows  the  encoding  of  instructions  such  that  the  CASPR 
chip  can  serve  as  a  real-time  prototyping  device  while  using 
the  same  interface  as  a  more  specialized  device.  Further,  the 
instruction  set  can  be  reconfigured  during  execution  time  to 
add  specialized  instructions  as  needed  [7]. 

The  advantages  of  the  proposed  architecture  are  generality, 
and  flexibility.  Devices  that  implement  this  architecture  are 
not  optimized  for  a  single  application,  rather  a  set  of 
applications  ranging  from  optical  computing  to  optical  storage 
and  communication.  Specifically,  any  task  that  involves 
receiving  pages  of  digital  data  from  an  optical  source, 
processing  the  data  in  parallel,  and  returning  results  to  a  host 
computer.  By  means  of  the  host  computer,  the  user  can 
define  an  instruction  set,  and  program  the  device  for  the 
desired  task.  Since  it  is  possible  to  construct  both  optical 
detectors  and  logic  circuits  on  one  monolithic  chip  in  a 
commercial  CMOS  process  [2],  CASPR  devices  can  be 
developed  for  a  minimal  cost.  Researchers  can  then  use  this 
device  to  rapidly  prototype  and  test  other  designs  (and  the 
systems  that  incorporate  them)  without  the  effort  and  expense 
of  developing  an  ASIC.  The  device  can  be  reused  as  needed, 
and  serve  as  a  platform  for  prototyping  a  variety  of  smart- 
pixel  devices. 

Figure  2  illustrates  a  possible  photonic  system  test-bed 
environment  suitable  for  prototyping  smart  pixel  based 
photonic  information  processing  systems.  In  this 
configuration  optical  data  is  passed  to  the  CASPR  chip  from 
either  a  page-oriented  optical  storage  device  or  a  spatial  light 
modulator  (SLM)  that  is  controlled  by  a  host  computer.  The 
CASPR  chip  processes  the  optical  data  along  with  data  held  in 
each  pixel  and  produces  electrical  output  signals  that  are 
passed  into  the  electrical  host  computer.  The  host  computer 
uses  the  electrical  output  signals  from  the  CASPR  chip  to 
drive  the  SLM  producing  a  new  set  of  optical  input  data  for 
the  CASPR  chip.  Thus,  figure  2  represents  a  recirculating 
photonic  information  processing  architecture  that  would  allow 
for  the  evaluation  of  a  variety  of  data  processing  applications 
including  data  sorting,  data  filtering,  and  matrix  manipulation 
systems.  Finally,  in  a  more  generic  implementation,  the 
straight  pass  optical  interconnect  between  the  SLM  and  the 
CASPR  chip  could  be  replace  with  a  more  complex 
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Figure  2  One  possible  prototyping  environment  suitable  for 
evaluation  of  photonic  information  processing  systems  based 
on  the  CASPR  chip. 

interconnection  network  allowing  for  implementation/testing 
of  more  sophisticated  photonic  information  processing 
systems. 

In  conclusion  we  have  proposed  a  reconfigurable  photonic 
information  processing  chip  based  on  a  photonic  VLSI  device 
technology.  A  test  chip  is  currently  being  fabricated  through 
the  MOSIS  CMOS  foundry  service.  Details  of  the  pixel 
design  and  operational  results  will  be  presented  in  the  near 
future. 
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Digital  image  halftoning,  often  referred  to  as  spatial  dithering,  is  the  transformation  of  a  continuous-tone 
input  image  into  a  binary-valued  output  image.  Digital  halftoning  is  particularly  important  to  applications 
such  as  laser  printing,  bilevel  displays,  xerography,  and  facsimile.  Error  diffusion  is  an  important  class 
of  digital  halftoning  transformations  in  which  the  error  associated  with  a  single  pixel  is  diffused  within  a 
local  region  for  the  purpose  of  improving  overall  halftoning  quality.  Classical  error  diffusion  algorithms 
are  unidirectional  and  consequently  introduce  visually  objectionable  artifacts  in  the  halftoned  images.  Re¬ 
cently,  a  fully  symmetric  approach  to  error  diffusion  based  on  a  neural  network  has  been  investigated  and 
implemented.1  This  new  approach  to  halftoning  has  demonstrated  improved  halftone  quality  over  other 
existing  algorithms.  The  limitation  in  this  approach  is  the  speed  of  convergence  of  the  neural  network  for 
reasonably-sized  images.  As  a  result,  hardware  architectures  of  this  error  diffusion  network  (EDN)  are  being 
investigated  which  provide  sufficient  switching  speed  and  connectivity  to  generate  real-time  halftone  images. 

Smart  pixel  technology  provides  a  promising  technological  alternative  for  the  implementation  of  the 
EDN  because  optical  input,  electronic  processing,  and  optical  output  are  integrated  in  a  single  array.  While 
current  smart  pixel  technology  could  support  256  x  256  array  sizes,  it  is  of  interest  to  investigate  partitioning 
approaches  which  use  smaller  physical  array  sizes  to  achieve  the  same  functionality  and  performance  as  larger 
arrays.  One  approach  to  this  partitioning  is  to  divide  a  large  image  into  smaller  sub-images,  multiplex  these 
sub-images  into  a  small  smart  pixel  EDN,  and  then  demultiplex  the  partitions  into  the  resulting  full-sized 
image.  This  concept  is  shown  graphically  in  Figure  1. 

There  are  a  number  of  conceivable 
approaches  to  the  partitioning  of  the 
image.  Simple  tiling,  in  which  the  in¬ 
put  image  is  segmented  into  N  sub¬ 
images,  each  being  processed  indepen¬ 
dently  of  the  other,  produces  undesir¬ 
able  artifacts  in  the  resulting  halftones. 

Figure  2(a)  demonstrates  this  result. 

Like  other  neural  networks,  the  EDN 
requires  interconnections  to  neighbor¬ 
ing  pixels  to  accomplish  the  halftoning. 

Since  there  is  no  tile-to-tile  communi¬ 
cation,  the  error  diffusion  region  is  restricted  and  the  diffusion  to  neighboring  tiles  is  lost.  The  result  is  the 
visually  objectionable  horizontal  and  vertical  banding  seen  in  Figure  2(a). 

In  research  reported  elsewhere,  several  alternatives  to  simple  tiling  were  investigated.2  The  method  that 
was  found  to  be  most  successful  was  to  constrain  the  pixels  along  the  outside  of  a  tile  as  it  was  being  halftoned. 
The  values  used  as  constraints  were  the  values  of  the  pixels  that  resulted  from  halftoning  neighboring  tiles. 
For  example,  once  the  tile  in  the  upper-left  corner  of  an  image  has  been  halftoned,  the  tile  immediately  to 
the  right  was  selected.  Instead  of  constructing  a  tile  of  the  same  dimensions,  the  horizontal  dimension  was 
increased  to  include  a  band  of  pixels  from  the  previously  halftoned  tile  to  the  left.  That  band  was  fixed,  or 
not  allowed  to  change  as  the  EDN  halftoning  of  the  second  tile  proceeded.  This  tied  the  result  of  halftoning 
the  second  tile  to  the  the  completed  halftone  of  the  first  tile.  Except  for  the  constrained  band,  the  smart 
pixels  of  the  EDN  were  allowed  to  converge  normally.  Details  of  this  technique’s  performance  remain  to  be 
studied,  but  the  initial  results  showed  visually  pleasing  output  from  several  different  classes  of  input  imagery. 
Figure  2(b)  shows  a  result  of  this  constrained  tiling  approach. 

This  work  supported  in  part  by  the  Army  Research  Office,  DARPA,  and  the  Topographic  Engineering  Center. 
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Figure  1.  Partitioning  approach  to  error  diffusion  halftoning  using 
a  smart  pixel  array. 
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(a)  Simple  tiled  halftone  (b)  Constrained  tiled  halftone 

Figure  2.  Halftones  produced  by  tiling. 


This  EDN  approach  to  halftoning  is  an  extension 
of  work  originally  reported  by  Anastassiou.3  He 
first  defined  a  frequency-weighted  mean  square  error 
distortion  measure  based  on  the  frequency  response 
of  the  human  visual  system  and  then  showed  that 
minimizing  this  metric  was  analogous  to  minimiz¬ 
ing  the  energy  function  in  a  classical  Hopfield-type 
neural  network.  In  equilibrium,  the  error  diffusion 
neural  network  satisfies  u  =  W(y  -  u)  +  x.  For  an 
nxm  image,  W  is  an  n2  x  m2  sparse,  nearly  circu- 
lant  matrix  derived  from  the  original  q  x  q  set  of  error  diffusion  weights  whj.  An  equivalence  to  the  Hopfield 
network  can  be  described  by  u  =  A(Wy  +  x),  where  A  =  (I  +  W)-1.  Effectively,  the  error  diffusion  net¬ 
work  includes  a  pre-filtering  of  the  input  image  x  by  the  matrix  A  while  still  filtering  the  output  image  y  but 
now  with  a  new  matrix,  AW.  Consequently  the  EDN  is  equivalent  to  a  certain  class  of  Hopfield  networks, 
as  shown  in  Figure  3.  The  partitioning  of  large  neural  networks  described  here  is  potentially  applicable  to 
other  applications  of  optoelectronic  neural  networks,  notably  signal  processing4  and  pattern  recognition.5 


Figure  3.  Equivalence  of  EDN(b)  to  Hopfield  net(a). 
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Abstract 

A  database  filter  suitable  for  accessing  optical  storage  devices  is  presented.  This  photonic  VLSI  chip  includes  a 
smart  pixel  array  that  selects  words  matching  a  search  criterion  and  control  circuitry  that  interfaces  the  filter  chip 
with  an  electronic  host  computer. 

Summary 

The  commercialization  of  CD  ROM  drives  has  clearly  demonstrated  the  ability  of  optical  storage  devices  to  meet  the 
current  demand  for  archival  data  storage.  However,  with  the  continued  expansion  of  electronic  infoimation 
resources,  storage  capacity  requirements  are  expected  to  approach  the  terabit  level  for  personal  users  and  exceed  the 
petabit  level  for  database  and  data  warehouse  systems.[l]  Further,  many  data  intensive  applications  will  continue  to 
require  real  time  data  access  rates.  Thus,  system  designers  for  the  next  generation  of  archival  storage  system  have 
the  challenging  task  of  providing  storage  capacities  several  orders  of  magnitude  larger  than  existing  systems  while 
maintaining  current  data  access  times. 

While  considerable  effort  has  been  put  toward  the  development  of  optical  storage  media  capable  of  providing  high 
capacity,  volumetric  data  storage,  relatively  little  work  has  been  done  to  explore  the  technological  barriers 
associated  with  providing  page-oriented  access  to  these  memory  devices.  In  particular,  issues  related  to  the 
bandwidth  mismatch  between  a  page-oriented  optical  memory  and  word-oriented  electronic  computer  have  not  been 
addressed.  More  specifically,  high  capacity,  high  data  rate,  page-oriented  optical  storage  devices  do  not  address  the 
bottle  neck  associated  with  getting  data  into  the  microprocessor  of  database  management  computer. 
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Figure  1  Block  diagram  of  a  database  filter  in  the  hierarchy  of  a  memory  access  sub-system. 

The  concept  of  database  filter  has  been  suggested  as  a  solution  to  this  problem  in  database  management  systems. [2] 
As  shown  in  Figure  1,  a  database  filter  is  an  optical  memory  interface  device  that  accepts  large  bandwidth  data  from 
the  page-oriented  optical  storage  device,  and  produces  word-oriented  output  at  a  data  rate  compatible  with  the 
electronic  host  computer.  To  accommodate  the  data  rate  mismatch  between  the  storage  device  and  the  electronic 
host  computer,  the  database  filter  preprocesses  the  optical  input  data  by  searching  for  words  that  match  a  user 
defined  search  criteria.  Thus,  the  filter  can  pass  only  valid  data  along  the  low  bandwidth  connection  to  the 
electronic  host. 

Over  the  last  few  years,  we  have  been  developing  smart  read-head  devices  for  large  capacity,  page-oriented  optical 
storage  systems.  [3 ,4]  Here  we  present  the  photonic  VLSI  design  and  initial  circuit  evaluations  for  our  latest  data 
filter  design.  Intended  to  function  as  a  smart  optical  memory  read  head  device,  the  database  filter  we  present  here 
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Figure  2  Block  digram  of  the  optoelectronic  database  filter.  The  central  grid  area  is  a  smart  pixel  array 
that  accepts  optical  page-oriented  data  and  executes  the  search  operation.  Valid  data  is  held  in 
the  smart  pixel  array  until  access  is  granted  to  the  external  bus. 

will  extend  the  capabilities  of  conventional  read  head  technology  by  including  on-the-fly  relational  database 
processing  integrated  into  the  read  head  of  a  page-oriented  optical  memory  system. 

A  conceptual  view  of  the  latest  database  filter  design  is  shown  in  Figure  2.  The  Optical  Data  Input  Smart  Pixel 
Array  reads  optical  data  from  a  page-oriented  optic  storage  device.  Data  is  organized  in  this  array  with  each  column 
representing  an  individual  word  in  the  memory.  Control  signals  provided  by  Bit  Selection  Controller  allow  each 
pixel  to  evaluate  if  its  data  bit  matches  the  data  selection  criteria.  The  results  from  individual  bit  comparisons  are 
summed  along  each  word.  If  the  result  of  the  bit  summation  indicates  that  a  word  matches  the  selection  criteria  then 
the  Data  Queue  Controller  loads  the  word  into  an  asynchronous  data  queue  located  in  the  smart  pixel  array.  A  Bus 
Arbitration  Controller  is  used  to  control  data  flow  between  the  individual  word  queues  and  the  Post  Processing 
Control  Circuit.  The  Post  Processing  Control  Circuit  performs  word  based  data  manipulation  operations  including 
data  projection  and  data  formatting. 

Based  on  a  photonic  VLSI  device  technology,  our  database  filter  monolithically  integrates  all  necessary 
optoelectronic  and  logic  components  into  a  single  CMOS  chip  that  can  be  readily  fabricated  using  standard  VLSI 
fabrication  facility.  Thus,  our  data  filter  is,  compact,  readily  integrated  into  optical  storage  read  head  designs, 
compatible  with  existing  electronic  device  manufacturing  technology  and  shares  all  of  the  reliability,  uniformity, 
and  manufacturability  benefits  associated  with  current  electronic  hardware. 

In  this  paper  we  will  present  an  evaluation  of  our  latest  smart  pixel  circuits  and  detail  the  performance  of  individual 
components  of  the  database  filter.  Finally,  we  present  our  expectations  for  a  16x16  bit  database  filter  currently 
under  development. 
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MB1.1 

8:30am-9:00am 


An  IR  tunable  diode  laser  absorption  spectrometer  for  trace  humidity  measurements  at 

atmospheric  pressure 

CS  Edwards(1),  GP  Barwood(1),  P  Gill(1),  M  Stevens(1),  B  Schirmer(2),  R  Benyon(3)  and 
P  Mackrodt(4) 

(1) National  Physical  Laboratory,  Queens  Road,  Teddington,  Middx  TW11  OLW 

(2) University  of  Erlangen-Niimberg,  Cauerstrasse  4,  D-91058  Erlangen,  Germany 

{3)Instituto  Nacional  de  Tecnica  Aerospacial,  Ctra.  a  Ajalvir,  km4,  E-28850,  Torrejon  de  Ardoz 
Spain 

(4)Physikalisch-Technische  Bundesanstalt,  Bundesallee  100,  D-38116  Braunschweig,  Germany 


Introduction 


There  is  an  increasing  demand  for  the  measurement  of  trace  humidity  in  air  at  levels  down  to  a 
few  parts  in  109  (ppbv)  at  atmospheric  pressure.  This  summary  describes  the  development  by  a  6- 
member  European  collaboration  of  a  diode  laser  absorption  spectrometer  capable  of  measuring 
trace  humidity  with  an  uncertainty  of  5  ppb.  The  diode  laser  was  a  distributed  feedback  device, 
tuned  to  the  3,0,3  <-  2,0,2  rotation  transition  in  the  Vi  +  V3  vibrational  band  of  water  vapour  at 
1393  nm[l]. 


Three  different  spectroscopic  techniques  were  investigated,  namely  wavelength  modulation 
spectroscopy,  single  frequency  modulation  spectroscopy  and  two-tone  frequency  modulation 
spectroscopy  (TTFMS).  Of  these  methods,  TTFMS  combines  the  advantages  of  being  the  most 
sensitive  technique  and  is  relatively  straightforward  to  implement.  After  a  number  of  refinements 
to  the  optical  layout,  TTFMS  features  were  obtained  with  no  observable  background  interference 
effects.  Figure  1  shows  a  scan  with  a  signal  to  noise  ratio  on  the  strongest  feature  of  5600.  This 
scan  was  obtained  in  0.4  m  of  laboratory  air  at  ambient  pressure  and  humidity  (=50%  relative 
humidity,  equivalent  to  a  concentration  of  1.2  x  104  ppm). 


Frequency  detuning  from  line  at  1392.53  nm  (GHz) 

Figure  I;  TTFMS  spectrum  obtained  in  0.4  m  of 
laboratory  air  at  ambient  humidity. 


In  order  to  provide  a  facility  for  trace 
humidity  measurement,  a  multi-pass  White 
cell  was  manufactured.  The  spectrometer 
design  is  shown  in  figure  2,  and  both  free 
space  and  fibre  pig-tailed  versions  of  the 
input  optics  were  built.  The  mirror  separation 
in  the  cell  was  0.25  m,  and  a  10-m  optical 
path  length  was  obtained  by  passing  the  beam 
40  times  through  the  cell.  The  peak  TTFMS 
signal  size  varies  linearly  over  a  wide  range  of 
water  vapour  concentrations.  This  allows 
verification  of  the  spectrometer  in  the  region 
above  1  ppm  where  certified  humidity 
measurement  is  possible.  The  TTFMS  signal 
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size  may  then  be  extrapolated 
down.  The  spectrometer  has 
dbm,  currently  undergone  tests  at 

the  four  humidity  standard 
laboratories  of  the  partners  in 
the  consortium  [2], 

The  diode  laser  is  computer- 

controlled  via  an  IEEE 

interface  and  current-scanned 

over  the  absorption  feature  to 

determine  the  peak  TTFMS 

signal.  The  TTMFS  results 

were  read  by  the  computer  via 

an  analogue  to  digital  input 

card.  Multiple  scans  over  the 

line  were  used  and  a  curve 

Figure  2:  Diode  laser  spectrometer  and  multi-pass  White  cell.  The  diode  fitting  routine  implemented  in 

laser  driver  was  computer  controlled.  ,  .  ,, 

F  order  to  improve  on  the 

sensitivity  and  reproducibility.  The  software  controls  a  voltage  output  from  the  PC  which  varies 

linearly  with  the  water  vapour  concentration,  with  a  reproducibility  of  5  ppb.  The  concentration 

is  also  displayed  on  the  screen  and  the  data  recorded  as  a  function  of  time  on  disc. 

1 00  ppb  moisture  concentration 

0.120-j 


Acknowledgement:  The  authors  acknowledge 
the  support  of  the  European  Union  under 
contract  SMT4-CT96-2125. 
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Figure  3:  TTFMS  signal  from  the  ASTHMEG 
spectrometer  at  a  100  ppb  water  vapour  concentration 
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9:00am-9:30am 


A  Trace  Gas  Measurement  Device  Using  Cavity  Ring-Down  Spectroscopy 

Wen-Bin  Yan,  Yei  Wo,  MEECO,  Inc.,  250  Titus  Avenue,  Warrington,  PA  18976 
John  Dudek,  Paul  Rabinowitz,  Kevin  Lehmann,  Dept,  of  Chemistry,  Princeton  University, 
Princeton,  NJ  08544 


Summary 

Since  1995,  MEECO,  Inc.,  in  collaboration  with  Professor  Lehmann  of  Princeton 
University,  has  been  developing  Cavity  Ring-Down  Spectroscopy  (CRDS)  for  trace  gas 
measurement.  This  work  provided  the  first  demonstration  that  CRDS  could  be  used  with 
highly  attractive  continuous  wave  (CW),  semiconductor  diode  lasers.  As  a  result, 
Princeton  University  was  awarded  a  patent  that  covers  the  use  of  any  cw  laser  excitation 
in  CRDS.  CW,  single-mode  semiconductor  diode  lasers  make  CRDS  commercially 
viable,  thanks  to  their  small  size,  low  power  requirements,  long  lifetime,  monolithic 
construction,  and  modest  cost. 

MEECO’s  first  CRDS-based  commercial  product,  MTO-IOOO  will  be  introduced  mid- 
2000.  The  initial  target  species  for  measurement  is  moisture,  an  important  contaminant 
in  high-purity  process  gases  used  in  the  semiconductor  industry.  A  critical  factor  in 
achieving  acceptable  product  yields  with  ever-smaller  geometry  chips  is  maintenance  of 
ultra-low  contamination  in  semiconductor  process  gases.  It  has  been  established,  for 
instance,  that  moisture  contamination  levels  of  even  a  few  parts  per  billion  by  volume 
(ppb)  can  result  in  defects  on  silicon  wafers.  To  offset  the  damages  caused  by  the 
presence  of  moisture  in  process  gas  lines,  manufacturers  call  on  gas  suppliers  to 
guarantee  moisture  in  semiconductor-grade  gases  at  levels  no  greater  than  a  few  ppb. 
Very  few  convenient  analytical  methods  are  presently  available  to  measure  residual  water 
vapor  at  such  levels. 

The  basic  operating  principle  of  MTO-1000  is  based  on  absorption  spectroscopy,  which 
provides  the  most  general  spectroscopic  method  for  detection  of  important  trace 
species.  It  is  an  absolute  measurement  method  requiring  no  calibration.  In  the  gas 
phase,  the  spectrum  of  a  molecular  species  consists  of  many  sharp  rotational  lines  that 
provide  high  selectivity  (a  molecular  ‘fingerprint’).  In  a  conventional  absorption 
measurement,  the  intensity  of  light  transmitted  through  a  sample  is  compared  to  the 
transmission  without  the  sample.  The  sensitivity  and  accuracy  is  often  limited  by  the 
amplitude  noise  of  light  sources  and  the  detection  system.  CRDS,  on  the  other  hand,  is 
not  affected  by  the  laser  noise  because  it  only  measures  ring-down  time.  CRDS  also 
dramatically  increases  the  sensitivity  of  absorption  spectroscopy  because  its  effective 
absorption  pathlength  can  be  as  long  as  105  of  the  physical  length  of  the  cell. 

The  schematic  of  MTO-1000  is  shown  in  Figure  1 .  With  a  one  meter  cavity  and  a  1 .39 
pm  diode  laser,  we  have  measured  ring-down  time  as  long  as  300  ps,  which 
corresponds  to  a  pathlegth  of  90  km.  This  device  is  capable  of  measuring  500  ppt  of 
moisture  accurately  in  ultra-pure  gases. 
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Through  its  new  Tiger  Optics  subsidiary,  MEECO  is  also  developing  this  technology  to 
measure  a  wide  range  of  critical  molecules,  such  as  CO,  NH3,  HF,  HCI,  CH4.  Future 
capabilities  will  be  discussed  in  detail  at  the  meeting. 


6 


MB1.3 

9:30am-1 0:00am 


Application  of  Diode  Laser  Spectroscopy  for  Moisture  Contamination 
Measurement  in  Semiconductor  Processing 

James  McAndrew,  Ronald  Inman  and  Dmitry  Znamnensky 
Air  Liquide 
5230  S.  East  Ave. 

Countryside,  IL  60525 
U.S.A. 


Introduction 

High  purity  of  process  atmospheres  is  usually  considered  critical  to  high  yields  in  semiconductor 
processing.  As  device  dimensions  become  smaller,  purity  demands  increase,  as  manifested  in  the  ppt 
(part-per-trillion)  contamination  levels  now  routinely  required  of  carrier  gases.  In  recent  years,  however, 
it  has  become  clear  that  other  sources  of  contamination,  such  as  reactive  gases,  the  process  chamber  and 
semiconductor  wafers  themselves,  are  more  important.  Tunable  diode  laser  absorption  spectroscopy 
(TDLAS)  is  extremely  useful  for  such  environments:  As  a  spectroscopic  technique  that  exposes  only 
optical  elements  to  the  analyte,  it  is  compatible  with  most  matrices.  In  this  work,  we  review  some  recent 
applications  of  this  technique,  both  in  the  laboratory  and  in  the  wafer  fab. 


TDLAS  has  been  applied  to  trace  contamination  measurement  in  high  purity  reactive  gases  -  improving, 

for  example,  the  achievable  detection  limit  for 
H20  in  NH3  from  about  2  ppm  (by  FT1R)  to  50 
ppb  -  and  to  in  situ  monitoring  in  semiconductor 
process  tools.  In  situ  moisture  monitoring 
enables  improvements  in  overall  equipment 
effectiveness  (OEE)  by  reducing  time  spent  on 
purging  procedures.  Moisture  is  an  excellent 
indicator  of  atmospheric  contamination,  being 
perhaps  the  most  difficult  atmospheric  impurity 
to  remove.  By  measuring  true  contamination 
levels,  rather  than  simply  relying  on  pre-set 

0  1  2  3  4  5  6  7  timing,  significant  savings  are  achievable.  In 

Generated  Concentration  (ppm)  addition,  non-product  wafers  for  qualification 

and/or  trouble-shooting  can  be  reduced  if  the  true 
Figure  1:  Calibration  verification  after  15  system  status  is  known, 

months  on  process  tool 


E  1 

3  5-1- 

D) 

■-  4 

"O 

8.  3 

8  2 
c 

(U  1 
CO 


...  -y  =  * 


Air  Liquide’s  TDLAS  technology  for  moisture 

measurement  in  semiconductor  gases  and  process  atmospheres  is  now  available  commercially  from 
SOPRA  (Bois-Colombes,  France),  a  manufacturer  of  optical  instrumentation  for  semiconductor 
manufacturing  and  related  industries. 


TDLAS 

Air  Liquide  selected  TDLAS  for  development  because  it  is  a  relatively  well-established  and  robust 
technology  already  widely  applied,  not  only  in  the  laboratory,  but  also  in  numerous  demanding  field 
applications.  It  has  been  shown  to  be  capable  of  delivering  the  ppb  (part-per-billion)  sensitivity  levels 
required  for  semiconductor  process  gas  monitoring.  TDLAS  does  not  demand  extremely  high  mirror 
reflectivities  as  do  intracavity  laser  and  cavity  ring-down  spectroscopies,  and  is  therefore  less  likely  to 
display  performance  loss  over  time.  To  date  we  have  focussed  on  the  application  of  TDLAS  to  H20 
monitoring,  but  its  application  to  other  species  is  straightforward  once  a  diode  emitting  at  the  appropriate 
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wavelength  is  available.  As  TDLAS  becomes  widely  adopted,  diodes  that  emit  at  wavelengths 
characteristic  of  most  molecules  of  interest  are  expected  to  become  available. 

Because  TDLAS  is  based  upon  absorption  spectroscopy,  the  measurement  is  absolute  and  can  be  directly 
related  to  fundamental  properties  of  the  molecule  of  interest.  In  practice,  it  is  often  more  convenient  to 
quantify  the  response  of  the  signal-processing  electronics  by  direct  comparison  with  a  known  moisture 
standard.  The  reliability  of  this  procedure  is  illustrated  in  Figure  1 .  The  sensor  reading  was  verified  after 
15  months  on  a  semiconductor  process  tool  and  found  to  be  still  in  calibration  within  the  accuracy  of  the 
moisture  generator  used.  This  chart  is  based  upon  a  test  such  as  that  illustrated  in  Figure  2,  in  which 
various  moisture  levels  are  generated  by  varying  the  flow  of  pure  gas  over  a  permeation  device.  In  this 
figure,  the  rapid  response  of  the  hygrometer  is  clearly  illustrated. 


Figure  2:Response  to  varying  moisture  input 


Applications 

The  application  of  TDLAS  to  a  variety  of 
semiconductor  process  environments,  including 
RTP,  CVD  and  etch,  will  be  reviewed1,2. 

TDLAS  has  been  demonstrated  to  be  compatible 
with  aggressive  gases  such  as  HC1,  NH3, 

SiH2Cl2,  etc.  and  with  strongly  depositing 
atmospheres  as  in  Si3N4  LPCVD3.  In  addition4,  it 
has  been  shown  to  be  capable  of  trace  moisture 
measurements  to  50  ppb  in  pure  NH3  and  to  10 
ppb  in  a  variety  of  corrosive  gases  including 
HC1.  These  applications  will  be  discussed  as 
will  the  key  differences  in  approach  required  for 
in  situ  and  for  high-purity  gas  measurement. 
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MB2.1 

10:30am-1 1:00am 


Using  CW-CRDS  for  Trace  Gas  Detection 

John  Dudek,  Paul  Rabinowitz,  and  Kevin  K.  Lehmann 
Department  of  Chemistry,  Princeton  University,  Princeton,  New  Jersey  085 44 

Wen-Bin  Yan 

Meeco  Inc.,  250  Titus  Avenue,  Warrington,  PA  18976 


Abstract 

We  have  developed  a  technique  to  measure  trace  concentrations  of  gases  in  the 
near-infrared  region  using  Continuous  Wave  Excitation  Cavity  Ring-down  Spectroscopy 
(CW-CRDS).  We  have  demonstrated  that  with  a  simple  CW-CRDS  system,  trace 
amounts  of  moisture  (water  vapor)  can  be  accurately  detected. 

Summary 

Detection  of  trace  gas  species  continues  to  be  an  important  application  of  spec¬ 
troscopy.  Absorption  spectroscopy  is  particularly  attractive  for  such  applications 
because  of  its  high  specificity,  high  speed  of  response,  and  absolute  principle  of  cali¬ 
bration,  based  upon  Beer’s  Law.  The  challenge  is  to  reach  ultrahigh  sensitivity  with 
this  method  in  order  to  detect  ultralow  concentrations.  Also,  this  method  should  be 
implemented  in  a  way  that  is  robust,  compact,  and  of  modest  cost  for  a  potential 
sensor.  For  the  past  several  years,  we  have  been  using  continuous  wave  (CW)  Cav¬ 
ity  Ring-down  Spectroscopy  [CW-CRDS]  as  a  tool  to  meet  this  challenge.  We  have 
used  this  technique  to  measure  trace  concentrations  of  gases  in  the  near  infrared  re¬ 
gion  because  this  region  contains  both  vibrational  overtone  transitions  and  forbidden 
electronic  transitions  of  many  atmospherically  important  species.  Our  research  has 
focused  on  investigating  the  properties  of  the  cavity  ring-down  system  in  order  to 
increase' detection  sensitivity  of  these  atmospherically  important  species.  Specifically, 
we  are  studying  moisture  detection  at  the  parts  per  billion  (ppb)  to  the  parts  per 
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trillion  (ppt)  range,  shown  in  figure  1.  We  have  obtained  a  detection  limit  of  better 
than  lxlO-11  cm-1  per  root  Hz  for  R  =0.999985  reflectors  which  allows  us  to  detect 
water  at  less  than  10  ppb. 

CW-CRDS  method  is  limited  to  spectral  regions  where  one  can  obtain  high  reflec¬ 
tivity  dielectric  mirrors,  and  there,  only  over  spectral  tuning  ranges  of  a  few  percent 
for  a  single  set  of  mirrors.  To  overcome  these  limitations,  we  have  designed,  con¬ 
structed,  and  tested  an  astigmatic  prism  ring  resonator  which  utilizes  total  internal 
reflection  to  achieve  high  reflectivity,  a  curved  prism  face  for  optical  stability,  and  re¬ 
flective  coupling  near  Brewster’s  angle  to  provide  controlled  finesse  of  the  resonator. 
We  have  measured  the  loss  of  this  resonator,  which  includes  scattering,  diffraction, 
and  stress  induced  birefringence,  in  the  range  between  470  nm  to  1.6  /im  and  found 
good  agreement  with  calculated  values.  The  minimum  loss  per  pass  excluding  cou¬ 
pling  occurred  at  900  nm  and  was  «  7  x  10-5.  It  was  mainly  due  to  excessive 
diffraction  resulting  from  an  error  in  fabrication  which  we  anticipate  will  be  improved 
significantly  in  a  new  prism. 


Wavelength  (nm) 

Figure  1:  Moisture  Scan  from  1396.1  nm  to  1398.4  nm  at  a  concentration  of  200  ppb. 
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MB2.2  (Invited) 
11:00am-1 1:45am 


Trace  Moisture  Detection  using  Near  Infrared  Diode  Lasers 

Chris  Hovde 

Southwest  Sciences,  Inc.,  Ohio  Operations,  6837  Main  Street,  Cincinnati  OH  45244 

and  Joel  A.  Silver 

Southwest  Sciences,  Inc.  1570  Pacheco  Street,  Suite  E-ll,  Santa  Fe,  NM  87505 

The  measurement  of  trace  concentrations  of  water  vapor  is  import  in  many  industrial  applications, 
especially  semiconductor  fabrication.  A  variety  of  methods  exists  for  measuring  water  vapor  at 
concentrations  approaching  one  nmole/mole  (one  ppb).  Optical  methods  include  Fourier-transform 
spectrometry,  intracavity  laser  absorption  spectroscopy,  and  cavity  ring-down  spectroscopy.  However,  we 
have  chosen  to  focus  on  wavelength  modulation  spectroscopy  (WMS)  as  a  practical  method  to  achieve 
sufficient  sensitivity.1  WMS  is  a  variation  of  transmission  spectroscopy,  and  thus  it  shares  the  calibration 
benefits  of  other  Beer’s  Law  techniques.  By  rapidly  modulating  the  laser  wavelength,  laser  1/f  noise  is 
avoided,  so  very  small  changes  in  laser  power  can  be  measured.  When  WMS  is  combined  with  a  multiple 
pass  absorption  cell,  ultra-sensitive  detection  of  water  vapor  is  possible,  even  when  detecting  H20  using 
weaker  overtone  transitions  with  near-infrared  InGaAsP  diode  lasers  near  1 392.5  nm.  This  approach  has  the 
desired  combination  of  sensitivity  and  accuracy  while  enjoying  the  benefits  of  rugged  communications-type 
diode  lasers.  Other  researchers  have  employed  similar  techniques.2,3 

The  diode  laser  sensor  measures  the  amount  of  water  vapor  by  quantifying  the  absorbance  of  a  single 
rotational  line  within  a  vibrational  overtone  band  of  water  vapor.  To  measure  the  line  profile,  the  laser  drive 
current,  and  hence  its  wavelength  or  frequency  v,  is  ramped  once  per  minute  over  the  absorption  line  and 
simultaneously  modulated  at  frequency /=  50  kHz  with  a  sine  wave.  The  absorption  signal  as  a  function  of 
ramp  position  is  quantified  by  lock-in  detection  at  twice  the  modulation  frequency,  If.  The  water  vapor 
concentration  is  determined  by  linear  least  squares  fitting  of  the  resulting  spectrum  using  a  theoretical  model. 

The  sensor  design  is  based  on  a  sealed  laser  head  connected  to  a  Herriott  multipass  cell  constructed 
from  ultra-high  vacuum  stainless  steel  flanges  sealed  with  copper  gaskets.  The  sample  pressure  is 
maintained  near  13.33  kPa  (100  Torr),  balancing  signal  strength  while  keeping  the  line  shape  sufficiently 
narrow  so  that  the  full  shape  g(v)  can  be  recorded  within  the  wavelength  scan  range  of  the  laser.  The 
transmitted  photocurrent  is  detected  by  an  InGaAs  photodiode;  the  dc  and  2/components  of  the  photocurrent 
are  separately  amplified.  To  suppress  etalons ,  the  laser/lens  assembly  is  mechanically  vibrated4  and  a  jitter 
modulation5  is  used. 

For  low  moisture  levels,  the  absorbed  laser  power  is  small,  and  a  linear  approximation  to  Beer’s  Law 
can  be  used.  The  2/component  of  the  detected  photocurrent  is  proportional  to  the  transmitted  laser  power, 
line  strength,  path  length,  and  water  concentration  (all  of  which  are  constant  over  the  20  ps  modulation  time 
scale),  times  the  2/Fourier  component  of  g(  vm(t)  +  v>)  —  the  WMS  spectral  line  shape  gWMS(  y).  This  linear 
relationship  between  the  small  absorbance  signal  and  the  line  shape  permits  the  use  of  linear  least  squares 
analysis  to  determine  the  best  multiplier  of  the  WMS  line  shape  and  thus  the  best  estimate  of  the  water 
concentration.  The  least  squares  model  includes  five  vectors:  a  numerical  WMS  demodulated  Voigt  line 
shape  gWMf  v),  the  derivative  of  this  demodulated  line  shape  d gWMf  v)/d  v ,  and  three  terms  describing  a 
quadratic  baseline.  The  derivative  term  is  used  for  locking  the  position  of  the  peak  within  the  scan. 

Measurements  using  NIST  Traceable  Standards  The  diode  laser  moisture  sensor  was  used  in  a  series  of 
calibration  measurements  against  the  new  Low  Frost-Point  Humidity  Generator  (LFPG)  at  NIST.6  The 
LFPG  saturates  an  inert  carrier  gas  stream  with  water  vapor  by  flowing  the  gas  over  a  plane  surface  of  ice 
at  fixed  temperature  and  pressure.  Its  output  can  be  predicted  from  thermodynamic  principles  and  is 
independent  of  gas  flow  rate.  The  WMS  instrument  was  calibrated  in  terms  of  the  expected  output  of  the 
LFPG  by  stepping  the  humidity  generator  through  a  series  of  set  points  ranging  from  5  nmol/mol  to 
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2.5  pmol/mol  of  water  vapor  in  air  (Fig.  1).  ^ 

Good  agreement  was  obtained  throughout  this  1 100Q 
range,  although  outgassing  within  the  laser  =g 
sensor  limited  the  accuracy  at  the  lowest  | 
moisture  levels.  "JT 

3  ioo 

The  Allan  variance  provides  a  useful  j£ 
measure  of  the  noise  as  a  function  of  averaging  s 
time  and  shows  that  the  resolution  and  minimum  g 

detectable  concentration  cannot  be  improved  jj  io 
without  limit  simply  by  increasing  the  averaging  8 
time.  Adopting  a  three  sigma  criterion  for 
statistical  significance  (99.7  %  confidence  level), 

the  minimum  detectable  change  in  mole  fraction  Figurel>  Calibration  curve.  Moisture  as  measured  by  the 
of  water  vapor  is  ~0.2  nmol/mol  for  averaging  diode  laser  sensor  vs  moisture  suppHed  by  the  ]ow  frost 

times  of  -20  mm.  point  generator,  showing  good  agreement  throughout  the 

A  useful  figure-of-merit  for  comparing  the  range  studied, 
performance  of  absorption  measuring 

instruments  is  the  noise  equivalent  absorptivity  (NEA),  defined  here  as  the  minimum  detectable  absorptivity 
per  square  root  of  the  data  acquisition  rate.  The  WMS  system  NEA  is  -  3.5*10'9  cm'  '/Hzl/2.  This  NEA 
figure  is  equivalent  to  an  absorbance  of  2T0‘6of  the  laser  power.  These  figures  compare  favorably  to  those 
obtained  with  other  sensitive  absorption  methods  and  are  especially  noteworthy  in  that  they  are  based  upon 
the  rate  at  which  entire  line  profiles  are  recorded  and  fit. 

Conclusions  Diode  laser  absolute  moisture  detection  is  sensitive,  precise  and  accurate.  Using  a  combination 
of  methods  to  suppress  etalons  and  achieve  high  signal/noise,  a  detection  limit  of  -0.2  nmol/mol  can  be 
achieved.  The  frost-point  temperature  equivalent  to  this  detection  limit  is  -117°C.  The  linearity  of  the 
instrument  is  excellent,  exhibiting  an  error  of  only  1%  at  water  concentrations  in  the  pmol/mol  range  (frost 
points  near  -65  °  C),  and  this  range  could  be  improved  by  using  the  full  exponential  form  of  Beer’s  Law.  The 
most  significant  limitation  to  the  accuracy  appears  to  be  from  outgassing  within  the  sample  region. 

Acknowledgment  We  are  grateful  to  James  Whetstone,  Joseph  Hodges  and  Greg  Scace  at  NIST  for  their 
assistance  and  co-operation  in  permitting  us  to  use  the  Low  Frost  Point  Generator,  and  David  Moschella  for 
providing  an  Allan  variance  routine  written  for  LabView.  This  work  was  funded  by  the  U.S.  Department 
of  Commerce  under  Contract  No.  50-DKNB-5-00189. 
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Trace  Moisture  Detection  in  Corrosive  Gases  Using  Quadrupole  Mass  Spectroscopy 


Hans  Funke.  Mark  Raynor,  Virginia  Houlding 
Matheson  Tri-Gas 
Advanced  Technology  Center 

Terry  Ramus,  Scott  Hein 
Diablo  Analytical,  Inc. 

Ralph  Kirk 

Advanced  Technology  Development,  Inc. 


An  online  analyzer  for  impurities  in  corrosive  gases  was  developed  using  an  HP-5973  MSD 
Quadrupole  Mass  Selective  Detector  equipped  with  customized  data  acquisition  and  gas  sample 
inlet  designed  by  Diablo  Analytical,  Inc.  The  instrument  was  evaluated  for  its  ability  to 
quantitatively  analyze  moisture  and  other  impurities  at  the  ppm  level  in  HCI. 


The  sample  gas  is  introduced  at 
atmospheric  pressure  through  a 
50  micron  orifice  into  a  vacuum 
line  system  that  is  pressure 
controlled  with  a  throttle  valve  and 
a  mechanical  pump  to  ~  10 1  torr. 
A  sample  stream  is  introduced 
through  a  1500  micron  orifice  and 
an  air  actuated  valve  into  the  MS 
chamber  for  analysis  (see  Figure 
1).  The  HP  tuning  algorithm  was 
not  applicable  in  the  mass  range 
of  interest  and  manual  tuning  was 
required  for  high  sensitivity. 


Tests  with  ppm  levels  of  oxygen  in 
nitrogen  and  argon  matrix 
demonstrated  a  very  fast 
response  to  changing  inlet 
concentrations  (Figure  2).  The 
instrument  was  sensitive  to  single 
digit  ppm  and  sub  ppm  changes 
and  calibration  curves  were  linear. 
The  sensitivity  was  a  strong 
function  of  the  matrix  gas.  The 
highest  sensitivity  was  obtained  in 
He  whereas  the  heavier  matrix 
gases  decreased  the  sensitivity  by 
about  1-2  orders  magnitude. 


Figure  2:  02  in  N2  matrix 


0-7803-6252-7/00/$10.00©2000  IEEE 


13 


Simiilar  results  were  obtained  in 
HCI  matrix  for  moisture  impurity. 
Moisture  was  measured  at 
m/z=18+19.  Figure  3  illustrates 
the  response  to  moisture 
challenges  of  11,  9,  7,  5,  and  3 
ppm.  The  quadrupole  chamber 
did  not  show  any  visible 
corrosion  during  several  weeks 
of  study.  The  electron  multiplier 
response,  however,  significantly 
decreased  in  the  time  frame  of 
hours  similar  to  the  results  in 
inert  gases. 


Figure  3:  Moisture  in  HCI  matrix  (m/z=18+19) 


Fe  ions  could  be  detected  in  HCI  matrix  in  the  mass  range  above  100  amu,  possibly  due  to 
contamination  of  the  source  gas  (Figure  4).  Also,  formation  of  Fe  species  in  the  source  cannot  be 
completely  ruled  out.  For  example,  Ti  species  were  detected  in  the  gas  phase  with  a  TiN  coated 
source,  indicating  that  at  least  a  fraction  of  the  observed  metal  species  stem  from  the  source. 


Figure  4:  Iron  chloride  species  detected  in  HCI  matrix 
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Application  of  CW-CRDS  to  Monitor  and  Control 
Chemical  Vapor  Deposition 


MB3.2 

2:00pm-2:30pm 


David  S.  Green, a,b  John  P.  Looney, b  Gary  W.  Rubloff a 

a.  Institute  for  Systems  Research,  University  of  Maryland,  College  Park,  MD  20742 
b.  National  Institute  of  Standards  and  Technology,  Process  Measurements  Division,  Gaithersburg,  MD  20899. 

ABSTRACT 

Continuous  wave  excitation  cavity  ring-down  spectroscopy  (CW-CRDS)  is  being  developed  for  real  time 
chemical  sensing  of  volatile  CVD  byproducts  (e.g.  HF,  H  20)  and  as  a  quantitative  and  robust  metrology 
tool  for  advanced  process  control  methodologies. 

SUMMARY 

Real-time  metrology  providing  information  on  reactant  depletion  and  product  generation  for  process 
control  during  semiconductor  manufacturing  is  needed  to  enhance  productivity  and  reduce  costs  [1]. 
While  there  are  many  methods  to  characterize  thin  films,  it  can  be  difficult  to  characterize  processes 
because  no  standardized  measurement  methods  exist  to  quantify  the  effects  of  gas  phase  changes  on 
film  properties.  We  believe  that  continuous  wave  cavity  ring-down  spectroscopy  (CW-CRDS)  [2]  holds 
potential  in  wafer  state  metrology  for  advanced  process  control,  yield  management,  and  quality 
assurance. 

Optical  sensors  exploiting  advances  in  diode  lasers,  coatings,  and  photonics  can  provide  compact, 
rugged  and  reliable  devices  for  in  situ  spectroscopic  sensing.  Absorption  spectroscopy  is  attractive  as  a 
metrology  method  because  it  is  species  specific  and  quantitative.  Process  control  becomes  viable  when 
gas  phase  observations  are  captured  in  real  time  and  combined  with  a  model  incorporating  the  effect  of 
process  parameters. 

Advantages  of  CRDS  over  conventional  laser  absorption  spectroscopy  include  lower  detection  limits,  and 
insensitivity  to  absorption  by  ambient  species  and  to  fluctuations  in  laser  intensity  and  phase.  Also,  the 
sample  volume  in  CRD  systems  can  be  made  relatively  small,  thus  providing  good  spatial  resolution  in 
the  transverse  direction  and  facilitating  integration  into  process  tools.  Finally,  CRD  signals  can  be 
modeled  with  a  relatively  simple  yet  rigorous  and  accurate  theory. 

We  are  investigating  the  quantitative  capabilities  of  CW-CRDS  by  targeting  hydrogen  fluoride,  HF,  and 
water  vapor,  HzO.  The  former  is  a  major  byproduct  occurring  when  excess  fluorine  (in  the  chamber  and 
on  surfaces)  reacts  with  hydrogen-containing  precursors.  H20  is  important  in  fluorine  process 
environments  since  it  is  the  most  common  reactive  impurity  involved  in  gas  phase  and  surface  reactions 
leading  to  the  formation  of  HF  and  other  compounds.  The  capability  to  monitor  the  real-time  evolution  of 
HF  in  a  process  chamber  will  provide  critical  feedback  used  for  system  control.  As  a  specific  example, 
this  has  the  potential  to  provide  wafer  state  reaction  information  that  is  the  basis  for  film  thickness 
metrology  in  CVD  systems. 

In  the  first  stage  of  this  work,  we  are  developing  a  bench-top  CW-CRDS  sensing  platform  to  provide  a 
prototype  system  that  is  compatible  with  semiconductor  manufacturing  applications.  This  prototype  will 
be  integrated  with  a  commercial  thermal  CVD  chamber  that  runs  tungsten  hexafluoride  (WF  6)  chemistries 
to  deposit  blanket  tungsten  films.  The  CRDS  measurements  will  be  used  to  characterize  the  limitations  of 
quadrupole  mass  spectrometry  measurements  of  HF  and  to  provide  wafer  state  reaction  information  that 
is  the  basis  for  film  thickness  metrology  [3].  We  anticipate  that  this  effort  will  help  drive  the  development 
of  several  aspects  of  CW-CRDS  technology  toward  a  reliable  field-deployabie  system. 

Unlike  many  other  applications  of  CRDS  emphasizing  detection  sensitivity,  this  study  is  not  aimed  at  trace 
species  for  fault  detection  or  contamination  control  but  aims  to  monitor  fluctuations  in  the  concentration  of 
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majority  reaction  byproducts  of  the  CVD  process  for  real-time  control.  Operating  conditions  in  the 
commercial  CVD  chamber  are  typically  67  Pa  pressure  and  400  °C  temperature.  Feed  gas  consists  of 
less  than  10%  WF6  by  volume  and  chemical  conversion  rates  of  2%  are  typical.  A  mechanistic  model  for 
W-film  growth  specifies  the  stoichiometry  of  HF  generated  for  each  WF  6  consumed.  Quantitative  analysis 
of  HF  concentration  can  then  be  correlated  to  film  thickness  [3], 

The  test  platform  being  developed  incorporates  a  compact  (nominally  25  cm)  high  finesse  linear  cavity 
encased  in  a  stainless  steel  or  anodized  aluminum  housing  with  O-ring  seals  and  gas  inlet  and  pump 
ports.  Temperature  control,  upstream  flow  control,  and  downstream  pressure  control  are  being 
implemented.  High  quality  dielectric-coated  mirrors  and  custom  mirrors  with  corrosion-resistant  coatings 

are  being  examined.  Pairs  of  high-reflectivity 
(99.995%)  mirrors  coated  at  670  nm  and  685  nm 
are  selected  for  HF  spectroscopy.  Commercially 
available  CW  tunable  diode  lasers  with  nominal 
powers  less  than  30  mW  and  time  averaged 
bandwidth  of  about  20  MHz  are  being  used  as 
the  light  sources  to  pump  the  ring-down  cavity. 
An  optical  isolator  is  used  to  prevent  feedback 
into  the  diode  source.  A  spatial  filter  and  a  mode¬ 
matching  lens  are  used  to  couple  into  a  single 
mode  of  the  ring-down  cavity.  In  one 
embodiment,  an  acousto-optic  modulator  in 
conjunction  with  a  gate  and  delay  generator  and 
a  low  noise  fast  photodiode  receiver  (monitoring 
the  ring-down  signal)  are  used  to  detect  light 
build-up  within  the  cavity  and  fast  switching  to 
induce  the  transient  decay  of  light.  The  signals 
are  digitized  and  processed  on  a  personal 
computer. 

CW-CRDS  can  be  used  to  probe  numerous  absorption  transitions  of  HF  and  water  vapor.  The 
spectroscopic  parameters  of  HF  are  known  and  the  rotational  structure  is  simplistic.  The  line  strengths  at 
room  temperature  for  the  strongest  rotational  features  of  the  HF  (0-4)  vibrational  band  are  estimated  to  be 
~1.2  *io'52  cnrf’cnr^/molecule  at  298K  using  the  data  of  Rimpel  [4],  Calculated  per  pass  losses  in  a  25  cm 
cavity  are  illustrated  in  Fig.  1.  As  indicated  by  this  figure,  for  HF  partial  pressures  greater  than  1  Pa  -10 
Pa,  optical  losses  of  -  10~5  -  10'4  per  pass  are  expected  and  should  be  readily  observable.  Number 
densities  will  be  extracted  from  the  ring-down  signals  and  used  as  input  parameters  to  numerical 
simulations  of  W  deposition. 

CRDS  studies  will  be  part  of  a  larger  view  toward  the  impact  of  optical  sensing  in  semiconductor 
manufacturing  processes.  We  will  use  lessons  learned  from  the  CRDS  experience  as  a  stimulus  for 
identifying  and  assessing  the  viability  of  different  sensor-based  approaches  to  wafer  state  metrology  for 
control.  We  expect  that  the  application  itself  will  initiate  new  ideas  in  CRDS  and  more  generally  in  optical 
absorption  methods. 
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Figure  1.  Estimated  Fractional  Optical  Loss  Per  Pass  for  HF 
absorption  at  670  nm  in  a  25  cm  cavity  at  298K. 
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MB3.3  (Invited) 
2:30pm-3:15pm 


Plasma  Diagnostics  for  Semiconductor  Processing 

Harold  M.  Anderson 

Department  of  Chemical  and  Nuclear  Engineering 
University  of  New  Mexico 
209  Ferris  Engineering  Center 
Albuquerque,  NM  87131  USA 

Frequency-modulated,  infrared  diode  laser  absorption  spectroscopy  (IRLAS)  and  full- 
spectrum  CCD-based  optical  emission  spectroscopy  (OES)  are  two  powerful  new  tools  for 
plasma  diagnostics.  This  presentation  focuses  on  the  use  of  these  diagnostics  in  plasma  etch 
processing  used  for  semiconductor  manufacturing.  In  particular,  IRLAS  is  finding  extensive  use 
in  measuring  absolute  concentrations  of  gas  phase  radicals  formed  in  the  oxide  etch  process 
performed  with  fluorocarbon  high-density  plasma  (HDP)  discharges.  Knowledge  of  the  absolute 
concentrations  of  reactant  species  (CFX)  and  product  species  (SiFx  and  COFx)  formed  in  trace 
amounts  (10!1-1014  cm-3)  during  etching  of  oxide  substrates  is  important  for  predicting  the 
behavior  of  this  complex  plasma- surface  interaction.  These  measurements  are  a  critical  element 
in  a  SEMATECH  sponsored  program  for  plasma  etch  model  development,  and  are  intended  for 
validating  computer  simulation  models  of  this  complex  process.  Full- spectrum  OES  on  the  other 
hand  is  a  more  mature  technology  that  is  finding  application  in  day-to-day  semiconductor  fab 
plasma  processing  operations  like  end-point  and  fault  detection.  With  the  development  of  CCD 
sensors  and  micro- machined  diffraction  gratings,  it  has  been  possible  to  build  miniaturized 
spectrometers  mounted  on  computer  boards  that  allow  rapid  acquisition  of  the  complete  uv- 
visible  optical  emission  spectrum  of  the  plasma.  Chemometric  multivariate  statistical  analysis  of 
this  full- spectrum  database  in  real  time  is  made  possible  by  integrating  the  spectrometer  with  a 
fast  portable  computer.  In  combination,  these  techniques  have  been  shown  to  provide  superior 
sensitivity  for  detecting  process  end-point  in  real-time  and  information  for  process  fault 
detection.  This  capability  is  becoming  more  important  to  the  semiconductor  industry  as  the 
critical  dimensions  of  the  chip  are  approaching  the  extreme  sub-micron  (~0. 1  pm)  range  and  the 
exposed  area  of  the  thin  film  to  be  etched  is  dropping  to  under  one-half  percent  of  the  wafer 
surface. 


The  discussion  of  IRLAS  focuses  on  the  ability  to  detect  trace  amounts  of  radical  species 
important  to  the  oxide  etch  process  with  only  a  single  or  double  pass  using  frequency  modulation 
second  derivative  (or  2f)  spectroscopy.  This  aspect  of  the  diagnostic  tool  design  is  critical  since 
any  plasma  diagnostic  must  be  non-perturbing  to  the  plasma  and  be  accomplished  without 
radically  altering  the  commercial  etch  tool  chamber  configuration.  At  best,  commercial  etch 
tools  provide  only  one  to  two  small  diagnostic  windows  for  viewing  the  plasma  during  operation. 
Practically  speaking  then,  this  limits  the  type  of  optical  diagnostic  data  acquisition  to  emission  or 
line- of- sight  absorption.  Also,  since  space  in  a  semiconductor  clean  room  is  extremely 
expensive,  the  size  or  foot- print  of  the  diagnostic  tool  must  be  compact.  Solid-state  diode  lasers 
operating  in  the  mid-IR  range  (3- 1 2  |im)  meet  all  these  requirements  and  provide  radical 
detection  sensitivity  down  to  number  densities  in  the  1010- 101 1  cm'3  range  given  a  few  passes 
through  the  plasma  chamber  (30-40  cm  diameter)  and  2f  derivative  spectroscopy.  The  diode 
laser  in  the  system  employed  in  this  study  is  cooled  in  a  compact  liquid  nitrogen  dewar  mounted 
on  a  compact  portable  optical  board.  Commercially  available  temperature  and  current 
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controllers  along  with  a  lock-in  amplifier,  oscilloscope  and  PC  computer  are  all  that  are  needed 
for  data  acquisition.  Although  not  suited  for  routine  day-to-day  use  in  a  semiconductor  fab,  this 
type  of  diagnostic  tool  has  been  used  extensively  for  diagnostics  studies  of  plasmas  in 
semiconductor  research  facilities  such  as  those  found  at  SEMATECH,  etch  tool  manufacturer 
development  facilities  and  universities. 

Much  of  this  portion  of  the  presentation  will  focus  on  diagnostics  needed  for  the 
improving  the  performance  of  next  generation,  inductively  coupled,  low  pressure  HDP 
discharges  that  are  being  manufactured  to  meet  the  coming  need  for  extreme  sub-micron  etching. 
The  reactors  typically  operate  with  pressures  in  the  5-10  mTorr  range  and  generate  plasmas  with 
densities  in  the  high  101  cm'3  range.  Radicals,  formed  by  electron  impact  dissociation,  drive  the 
important  surface  reactions  even  though  their  number  densities  are  only  in  the  1012- 1014  cm"3 
range.  The  CFX  radicals  used  in  oxide  etching  are  extremely  important  to  the  process  since  they 
provide  for  the  selectivity  of  the  etch  (i.e.  the  ability  to  stop  etching  the  silicon  underlying  oxide 
film).  This  is  one  of  the  most  critical  etch  steps  in  semiconductor  manufacturing  and  is  used  to 
establish  the  contacts  to  IC  devices.  A  thin  (2-3  nm)  CFX  polymeric  deposited  film  is  shown  to 
be  crucial  for  regulating  the  etch  during  processing.  Monitoring  the  concentrations  of  CFX 
radicals  is  in  turn  shown  to  be  important  to  controlling  the  amount  of  polymer  deposition. 

The  OES  sensor  discussion  focuses  on  the  advantages  of  full  spectrum  CCD-based 
optical  emission  spectroscopy  (OES)  over  monochromator-based  systems  and  the  design  of  a 
compact  computer  integrated  sensor  known  as  the  EP-2000  system  now  manufactured  by  Cetac 
Technologies.  Traditionally,  monochromator  based  systems  have  been  used  to  determine 
endpoint  by  monitoring  one  or  two  strongly  emitting  wavelengths.  For  exposed  open  areas  of 
<1.0%,  a  more  sensitive  approach  is  required  for  the  next  generation  of  chips.  CCD  array 
detector  based  systems  can  provide  a  wealth  of  spectral  information  from  a  variety  of  potentially 
useful  gas  phase  emitting  species.  In  the  case  of  particularly  challenging  applications  such  as 
reverse  mask  shallow  trench  isolation  (STI)  and  contact  etches,  utilization  of  the  full  optical 
emission  spectrum  has  been  shown  to  provide  tangible  benefits.  Production  facility  results 
regarding  these  and  other  demanding  applications  will  be  presented.  The  talk  will  largely  focus 
on  oxide  etching  in  AMAT  MXP  and  AMAT  HDP  platforms.  Evolving  Window  Factor  Analysis 
(EWFA)  and  Multiple  Curve  Resolution  (MCR)  are  the  principal  multivariate  techniques  used  in 
the  analysis.  They  allow  one  to  dynamically  track  the  principal  components  of  the  oxide  etch 
process.  EWFA  is  also  shown  to  useful  for  automatic  fault  detection.  MCR  is  used  to  depict  the 
dynamic  rate  of  formation  (or  depletion)  of  the  principal  chemical  species  in  the  plasma  during 
the  etch.  By  integrating  the  sensor  within  a  computer  and  coupling  the  sensor  to  the  plasma 
emission  via  optical  fibers,  one  is  able  to  collect  and  statistically  analyze  the  full- spectrum  data 
in  a  1-2  hundred  milliseconds.  This  allows  for  sampling  at  5- 10  Hz  rate,  which  in  terms  of  the 
oxide  etch  process  is  effectively  real  time.  The  compactness,  ease  of  use,  reliability  and 
robustness  of  this  type  of  optical  sensor  makes  it  extremely  valuable  to  the  semiconductor 
industry  for  day-to-day  process  control.  Units  are  now  in  use  at  IBM,  Motorola  and  other 
semiconductor  manufacturing  facilities. 
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In  situ  reflectance  monitoring  of  epitaxial  growth 

William  G.  Breiland,  Dept  1126  MS  0601 

Sandia  National  Laboratories,  Albuquerque,  NM  87185-0601 

Epitaxial  growth  of  compound  semiconductors  is  accomplished  with  Metal  Organic  Chemical 
Vapor  Deposition  (MOCVD)  or  Molecular  Beam  Epitaxy  (MBE).  In  both  these  methods,  many 
process  variables  must  be  precisely  controlled  to  achieve  the  proper  combination  of  layer 
thickness,  chemical  composition,  lattice  matching,  and  doping  levels  required  by  modem 
compound  semiconductor  devices.  We  have  found  that  a  simple  and  robust  in  situ  monitor, 
normal  incidence  reflectance,  has  proven  to  be  an  indispensable  tool  for  achieving  stringent 
device  requirements  for  epitaxial  growth. 


The  schematic  on  the  left  of  the  figure  above  shows  the  reflectance  experiment.  Interference  of 
light  between  interfaces  of  the  thin  films  cause  oscillations  in  the  reflectance  signal  as  the  films 
are  grown.  The  monitor  requires  only  one  window,  polarization  is  not  a  factor,  and  the  effects  of 
wafer  wobble  can  be  minimized.  The  screen  print  on  the  right  shows  the  interference  data  from  a 
pre-growth  calibration  run.  These  data  are  analyzed  using  the  ADVISOR  (Analysis  of 
Deposition  Using  Virtual  Interfaces  and  Spectroscopic  Optical  Reflectance)  method  [1],  Growth 
rates  for  each  bubbler  source  and  for  several  doping  conditions  are  determined  to  better  than  1% 
accuracy.  These  numbers  are  fed  to  a  model-based  automated  recipe  generator  that  is  used  by 
the  MOCVD  tool  to  grow  a  device  structure.  Results  of  the  usefulness  of  the  ADVISOR 
approach  are  shown  in  the  figures  below. 
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The  left  figure  shows  the  reproducibility  of  the  cavity  wavelength  values  for  770  and  850  nm 
vertical  cavity  surface  emitting  laser  (VCSEL)  structures  [2].  These  devices  consist  of  nearly 
700  layers.  The  right  figure  shows  the  results  of  real-time  process  control  during  the  MBE 
growth  of  a  complicated  reflectance  modulator  structure  [3].  ADVISOR  analysis  was  done  on 
early  layers  to  modify  the  deposition  rates.  Squares  represent  cavity  wavelengths  that  would 
occur  without  correction.  Circles  represent  the  measured  cavity  wavelength  with  ADVISOR 
control.  Note  the  large  increase  in  device  yield  that  was  achieved  with  the  in  situ  monitor. 


Normal  incidence  reflectance  may  also  be  used  to  determine  surface  temperatures  during  growth. 
A  single-wavelength  pyrometer  will  exhibit  large  (up  to  100  °C)  artifacts  in  temperature  if  it  is 
not  corrected  for  the  change  in  emissivity  that  occurs  during  thin  film  growth.  For  smooth,  flat, 
opaque  surfaces,  the  emissivity  is  simply  (1  -  reflectance).  An  instrument  that  simultaneously 
records  thermal  emission  and  normal  incidence  reflectance  may  thus  be  used  to  perform 
emissivity-correcting  pyrometry  during  thin  film  growth.  The  figure  above  shows  a  plot  of 
reflectance  at  550  nm  (left  scale,  smooth  data)  superimposed  on  the  900  nm  emissivity-corrected 
surface  temperature  (right  scale,  noisy  data)  [4],  Subtle  temperature  effects  may  be  observed 
during  the  growth.  For  example,  a  pause  was  required  between  the  GaAsP  and  GaAs  layers. 
The  figure  reveals  that  the  reason  for  this  pause  was  to  allow  the  system  to  recover  to  the  530  °C 
growth  temperature.  The  figure  also  illustrates  the  high  signal-to-noise  that  may  be  achieved 
with  reflectance  waveforms.  The  layer  labeled  “GaAsSb”  is  a  77  A  quantum  well  between  120 
A  GaAs  barriers. 

Reflectance  has  also  been  used  to  monitor  plasma  etching  on  patterned  wafers.  Unlike  many 
silicon-based  etching  processes,  selective  etch  recipes  are  rare.  It  is  thus  useful  to  watch  the 
time-reversed  interference  pattern  evolve  during  an  etch  process  in  order  to  stop  the  etch  at  the 
correct  interface  or  within  the  correct  layer.  Complications  due  to  photoresist  etching  may  be 
dealt  with  using  digital  filtering  methods. 

1.  W.G.  Breiland  and  K.P.  Killeen,  J.  Appl.  Phys.  78  (1995)  6726. 

2.  W.  G.  Breiland,  H.  Q.  Hou,  H.  C.  Chui,  B.  E.  Hammons,  J.  Cryst.  Growth,  174  (1997)  564 

3.  W.  G.  Breiland,  H.  Q.  Hou,  B.  E.  Hammons,  and  J.  F.  Klem,  XXVIII  SOTAPOCS  Symposium,  Electrochem. 

Soc.  1998. 
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MB4.2 

4:30pm-5:00pm 


AlGaAs  Composition  Measurements  from  In  Situ  Optical  Reflectance* 

K.  A.  Bertness,**  J.  T.  Armstrong,  R.  B.  Marinenko,  L.  H.  Robins,  A.  J.  Paul,  J.  G.  Pellegrino,  P. 
M.  Amirtharaj,  D.  Chandler-Horowitz,  National  Institute  of  Standards  and  Technology, 
**Mailstop  815.04,  325  Broadway,  Boulder,  CO  80303 

We  describe  preliminary  determinations  of  AlGaAs  layer  composition  using  in  situ  optical 
reflectance  spectroscopy  (ORS)  data.  Reflection  high  energy  electron  diffraction  (RHEED) 
oscillations  are  used  to  independently  determine  the  composition  of  the  AlGaAs  layers.  The 
results  are  compared  with  ex  situ  measurements  of  photoluminescence  (PL),  photoreflectance 
(PR),  x-ray  rocking  curves,  and  electron  microprobe  analysis  (EMPA).  Although  additional 
work  is  needed  to  refine  the  uncertainty  estimates  and  reduce  sources  of  error,  we  find  that 
growth  rate  as  measured  by  ORS  agrees  with  RHEED  oscillation  data  to  within  2  %.  The 
ultimate  goal  of  this  project  is  to  produce  standard  reference  materials  (SRMs)  of  certified  alloy 
composition  to  mole-fraction  uncertainty  of  0.002  for  a  range  of  important  III-V  alloys. 

ORS  data  was  acquired  by  reflecting  monochromatic  light  at  near-normal  incidence  from  a 
semiconductor  wafer  mounted  in  the  growth  chamber  of  a  molecular  beam  epitaxy  (MBE) 
system.  The  optical  access  window  was  heated  to  minimize  deposits,  particularly  arsenic. 
Additional  details  have  been  published  elsewhere.1  Before  each  growth  run,  the  ALAs,  GaAs, 
and  AlGaAs  growth  rates  were  measured  separately  using  the  monolayer  oscillations  in  the 
intensity  of  the  zero-order  beam  in  the  RHEED  pattern.  The  A1  mole  fraction  was  calculated  by 
dividing  the  AlAs  growth  rate  by  the  sum  of  the  AlAs  and  GaAs  growth  rates,  resulting  in  an 
estimated  uncertainty  of  0.004  in  the  mole  fraction.  ORS  does  not  measure  the  composition 
directly;  however,  the  growth  rate  measurements  from  RHEED  and  ORS  are  based  on 
independent  physical  processes.  Over  a  large  sample  set,  agreement  between  RHEED  and  ORS 
growth  rates  indicates  that  the  composition  determined  by  RHEED  has  persisted  throughout  the 
run.  The  index  of  refraction  derived  from  ORS  can  also  be  used  to  derive  composition,  although 
our  present  data  set  is  insufficient  for  accurate  determinations. 

Typical  ORS  data  during  growth  of  a  thick  Alo.20Gao.go  As  layer  is  shown  in  Fig.  1,  along  with  a 
theoretical  fit  to  the  data  using  algorithms  based  on  the  virtual-substrate  formalism.2’ 3  The 
variables  optimized  by  the  fit  are  the  index  of  refraction  and  absorption  coefficient  of  the  layer, 
the  growth  rate,  the  effective  index  and  absorption  coefficient  of  the  underlying  layers,  and  an 
overall  reflectance  scaling  factor. 
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Fig.  1  Wafer  reflectance  (black)  at  925  nm  as  a  function  of  time  during  the  growth  of  a  layer  of 
Alo.20Gao.80  As  at  595  °C.  Also  shown  are  the  results  of  an  optimized  fit  (gray)  and  fit  residuals. 
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Fig.  2.  Index  of  refraction  and  ORS  growth  rate  deviation  from  RHEED  oscillation  growth  rate 
for  a  series  of  layers  with  different  Al  mole  fractions. 

Fig.  2  summarizes  two  important  layer  properties  derived  for  a  number  of  growths  with  different 
Al  composition.  The  growth  rate  determined  from  ORS  fits  was  within  1.5  %  of  that  from 
RHEED  for  all  runs.  There  is  a  fairly  large  variation  in  the  index  derived  for  the  set  of  runs  with 
Al  mole  fraction  near  0.20.  Some  variation  originates  with  the  difficulty  the  modeling  encounters 
in  distinguishing  among  changes  in  reflectance  scaling,  growth  rate,  and  index  of  refraction. 

This  set  also  comprises  runs  with  different  doping  levels,  doping  types  and  growth  temperatures 
(from  585  °C  to  600  °C),  all  of  which  can  alter  the  index.  We  expect  that  recent  improvements 
to  the  calibration  of  the  reflectance  scale  will  allow  reduction  of  the  uncertainty. 

In  Fig.  3,  we  compare  the  in  situ  composition  measurement  results  with  the  conventional  ex  situ 
measurement  techniques  of  PL,  PR,  x-ray  diffraction  rocking  curves,  and  EMPA.  More 
specimens  are  needed  to  improve  the  statistical  significance  of  the  data,  but  in  this  preliminary 
set  we  see  that  the  in  situ  measurements  agree  with  each  other  well.  The  EMPA  data  were 
analyzed  using  procedures  developed  specifically  for  this  project.4 
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Fig  3.  Comparison  of  in  situ  and  ex  situ  composition  determinations. 
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Real-Time  Monitoring  of  Epitaxial  Growth 

D.  E.  Aspnes 

Department  of  Physics,  North  Carolina  State  University,  Raleigh,  NC  27695-8202  USA 

ABSTRACT:  Optical  probes  not  only  provide  information  about  microscopic  mechanisms  of  epitaxy 
but  have  also  been  used  for  fully  automatic  sample-driven  closed-loop  feedback  control  of  the  growth  of 
compositionally  graded  materials  and  structures. 


The  drive  toward  higher  performance  and  increasing  complexity  in  semiconductor  devices, 
together  with  the  trend  toward  chemical-beam  deposition  methods  such  as  organometallic  chemical 
vapor  deposition  (OMCVD)  and  chemical  beam  epitaxy  (CBE),  have  provided  strong  incentives  to 
develop  a  better  understanding  of  growth  processes  and  better  methods  of  monitoring  and  even 
controlling  epitaxial  growth.  Optical  probes  are  of  interest  here  because  they  are  nondestructive, 
noninvasive,  and  function  in  relatively  high-pressure  ambients,  including  OMCVD  where  standard 
surface-analysis  probes  cannot  be  used. 

A  number  of  optical  techniques,  such  as  spectroscopic  ellipsometry  (SE),  spectroscopic 
reflectometry  (SR),  surface  photoabsorption  (SPA),  reflectance-difference  (-anisotropy)  spectroscopy 
(RDS/RAS),  p-polarized  reflection  spectroscopy  (PRS),  laser  light  scattering  (LLS),  and  second- 
harmonic  generation  (SHG)  have  been  either  refined  or  invented  to  overcome  the  intrinsic  insensitivity 
of  optical  probes  to  surface  conditions  and  meet  the  challenges  presented  by  monitoring  and  control.1’2 
The  surface-optical  probes  either  rely  on  symmetry  (RDS/RAS,  LLS,  SHG)  or  enhancement  of  surface 
sensitivity  by  operation  at  the  Brewster  (PRS)  or  pseudo-Brewster  (SPA)  angle  to  suppress  the  bulk 
response  in  favor  of  the  surface  component  of  the  total  reflected  intensity. 

However,  the  main  practical  considerations  in  growth  and  deposition  involve  layer  thicknesses  and 
compositions,  which  are  bulk  properties  that  can  only  be  determined  by  bulk  probes  such  as  SR  and  SE. 
SR  is  well  adapted  to  monitor  and  control  deposition  in  optical  thin-films  technology,  such  as  Bragg 
reflectors  for  vertical  cavity  surface-emitting  lasers.3  However,  for  control  purposes  one  needs  not 
average  compositions  but  fluctuations  of  these  compositions  from  target  values,  ideally  of  just-deposited 
material  in  a  region  where  the  thickness  is  ideally  vanishingly  small.  These  near-surface  data  have 
recently  become  available  through  the  development  of  virtual-interface  (V-I)  theory.4  Here,  kinetic 
ellipsometric  (KE)  data  and  their  derivative  with  respect  to  layer  thickness  are  used  to  determine  the 
dielectric  response  of  near-surface  material  down  to  the  monolayer  (ML)  level,  even  if  nothing  is  known 
about  the  underlying  material. 

The  potential  effectiveness  of  various  control  strategies  can  be  assessed  by  considering  the  four 
regions  associated  with  epitaxy:  the  ambient,  surface  reaction  layer  (SRL),  the  near-surface  region 
(NSR),  and  the  bulk.  The  ambient  consists  of  the  carrier  gas  (if  any)  and  the  nutrient  species  providing 
the  constituent  elements  to  the  growth  surface,  including  possibly  thermally  cracked  and  pre-reacted 
species.  The  SRL  contains  both  weakly  (physisorbed)  and  strongly  (chemisorbed)  bound  fractions.  The 
NSR  consists  of  the  most  recently  deposited  material,  and  its  thickness  is  defined  by  instrument 
sensitivity.  Since  the  ambient  feeds  the  SRL  and  the  SRL  feeds  the  NSR,  the  nature  of  the  SRL  is 
determined  by  its  interactions  with  both  ambient  and  NSR.  Although  for  control  purposese  information 
about  all  regions  can  be  used  to  advantage,  it  is  clear  that  the  weakest  and  strongest  links  in  the  control 
process  are  those  involving  the  ambient  and  near-surface  regions,  respectively.  Control  approaches 
based  on  wafer-state  (SRL5  and  NSR6"8)  measurements  also  have  the  advantages  of  bypassing  the  need 
to  accurately  measure  process  parameters  and  to  model  the  intervening  processes  leading  to  species 
incorporation.  In  addition,  they  can  automatically  correct  for  small  changes  as  they  occur,  and  are 
portable,  not  being  affected  by  the  idiosyncracies  of  a  particular  reactor. 
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While  monitoring  provides  useful  information,  the  ultimate  goal  of  real-time  diagnostics  is  wafer- 
driven  closed-loop  feedback  control  of  deposition.  For  structures  such  as  VCSEL  mirror  stacks  that  are 
relatively  thick  optical  fdms,  this  is  equivalent  to  endpoint  detection.  Here,  SR  is  adequate  and  even 
desirable,  since  reflectometers  operating  in  the  relative-reflectance  mode  are  relatively  simple  devices 
and  reflectance  measurements  require  only  normal-incidence  optical  access  to  the  growth  surface, 
minimizing  the  modifications  needed  to  growth  chambers.  For  composition  control,  especially 
involving  sub-nm  films,  it  is  necessary  to  use  SE,  where  both  phase  and  amplitude  are  returned  in  a 
single  measurement.  However,  spectroscopic  eilipsometers  are  considerably  more  complicated  than 
spectroscopic  reflectometers,  and  non-normal-incidence  optical  access  to  the  growth  surface  is  required, 
generally  requiring  in  turn  modifications  of  growth  chambers.  The  recent  development  of  rotating- 
compensator  SE  systems,  replacing  the  older  rotating-analyzer  and  rotating-polarizer  designs,  eliminates 
the  possibility  of  some  systematic  errors  and  allows  for  more  compact  configurations.9 

Holding  compositions  constant  is  the  least  demanding  version  of  wafer-driven  closed-loop 
composition  control,  since  averages  can  be  taken  over  relatively  large  thicknesses.  Using  SE  and  the 
virtual-interface  approach,  InGaAs  compositions  have  been  held  constant  to  within  0.1%  in  a 
multiwafer  production  OMCVD  reactor.10  Control  of  compositionally  graded  materials  requires 
analysis  of  relatively  thin  layers,  and  precision  is  correspondingly  less.  The  Bellcore  group 
demonstrated  ±3%  precision  in  the  growth  of  AlGaAs  structures,  where  the  composition  was 
determined  by  analysis  of  the  outermost  3  A  of  depositing  material.6  Control  of  continuously  graded 
compositions  has  been  demonstrated  in  SiGe  and  InGaP  as  well,  but  reported  precisions  are  lower.7-8 

The  two  groups  who  have  grown  symmetric  compositionally  graded  structures  under  sample- 
driven  closed-loop  feedback  control  found  that  the  flow  of  reactants  at  the  end  of  the  growth  of  such 
structures  is  different  from  that  at  the  beginning.6'8  For  AlGaAs  this  was  attributed  to  the  gettering  of 
triisobutylaluminum,  whereas  for  InGaP  structures  the  difference  appeared  to  be  due  to  the  rate  at  which 
trimethylindium  (a  solid)  is  picked  up  by  the  carrier  gas  as  it  passes  through  the  cell.  It  would  obviously 
be  difficult,  if  not  impossible,  to  model  such  hysteresis  effects.  This  indicates  that  the  accuracy  of  a 
"virtual  reactor",  where  the  properties  of  a  sample  are  predicted  via  computational  models,  must  be 
limited  at  best. 

This  work  was  supported  by  the  Office  of  Naval  Research  and  by  the  Max-Planck-Gesellschaft  via 
an  Award  for  International  Cooperation.  Work  at  NC  State  University  was  done  in  collaboration  with 
K.  A  Bell,  M.  Ebert,  K.  Flock,  G.  D.  Powell,  and  S.-D.  Yoo. 
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We  discuss  how  analytical  tools  in  a  characterization  lab  can  be  used  to  enhance  metrology  tools  in  a  fab.  The 
emphasis  is  on  the  interaction  between  a  characterization  lab  and  a  wafer  fab  belonging  to  the  same  industrial 
company  (as  opposed  to  labs  in  academia,  government  labs,  original  equipment  manufacturers,  or  the  manufacturers 
of  optical  analytical  or  metrology  tools).  Specifically  this  work  will  deal  with  ex  situ  analysis  rather  than  in  situ 
metrology. 


A  metrology  tool  in  a  fab  must  be  clean,  fast,  simple  to  operate,  and  must  be  able  to  measure  small  features.  A 
classical  example  is  a  reflectometry  tool.1  On  the  other  hand,  a  tool  in  a  characterization  lab  can  handle  a  much 
more  complex  sample,  can  take  a  significantly  longer  time  for  the  analysis,  and  often  has  the  luxury  of  using  a 
blanket  wafer.  The  analyst  is  also  available  for  consulting  with  regard  to  optimum  samples  configuration  in  order  to 
generate  the  information  required.  Our  emphasis  will  be  on  using  spectroscopic  ellipsometry  in  a  characterization 
lab  to  develop  optical  constants  of  unusual  material  so  that  this  information  can  be  used  in  a  reflectometry  tool  in  the 
fab. 


As  we  begin,  let  us  consider  the  reflectometry  tool  in  its  mode 
of  operation  where  the  lab  interaction  is  not  needed.  For  this 
example,  which  we  will  call  Example  #0,  we  consider  the 
measurement  of  the  thickness  of  well  known  materials  such  as 
thermal  oxide,  LPCVD  nitride,  or  photoresist  on  a  silicon 
wafer.  The  reflectometry  spectrum  is  shown  to  the  right.  In 
this  case,  the  spectrum  has  lots  of  structure,  the  index  of 
refraction  is  well  understood  and  the  extinction  coefficient  is 
zero.  There  are  no  surprises  here  and  the  reflectometry  tool 
must  simply  determine  the  thickness.  No  involvement  from 
outside  the  fab  is  required.  One  simply  pushes  the  button  and 
gets  the  answer. 


Wavelength  (nm) 


The  next  example.  Example  #1,  is  a  situation  where  a  small 
amount  of  interaction  with  a  characterization  lab  is  needed, 
m  Consider  silicon-rich  nitride.  This  might  be  deposited  by  either 
PECVD  or  LPCVD  and  the  excess  silicon  is  to  be  added  either 
O  to  control  stress  properties  or  to  make  an  anti-reflective  coating. 
CD  The  optical  functions  are  shown  at  left.  These  are  not  fixed,  as 
r*  is  the  case  for  thermal  oxide,  but  depend  on  how  the  material  is 
fabricated.  The  function  of  the  characterization  lab  is  to 
determine  the  optical  constants,  usually  with  a  variable-angle 
spectroscopic  ellipsometer.  A  table  of  the  optical  constants  are 
then  generated  and  fed  into  the  reflectometry  tool  in  the  fab.  At 
that  point,  as  long  as  the  material  is  fabricated  in  the  same  manner,  the  metrology  tool  can  be  used  in  its  normal 
mode,  i.e.  fast,  simple.  We  would  expect  that  the  characterization  lab  would  model  the  film  using  a  dispersion 
equation  such  as  the  Cauchy  equation,  or  with  the  slightly  more  complex  Tauc-Lorentz  equation.2 

For  the  next  example.  Example  #2,  we  consider  a  more  complex  issue.  Optical  techniques  are  not  usually  used  to 
measure  metal  films.  Historically,  metal  films  have  been  thick  enough  to  be  opaque  and  if  the  light  does  not  reach 
the  lower  interface,  having  more  material  does  not  change  the  measured  quantity.  Metal  films  with  thicknesses  less 
than  300A  are  used  in  several  applications,  however,  and  in  this  thickness  range,  the  films  are  not  totally  opaque  and 
can  be  measured  with  an  analytical  or  metrology  tool.  Examples  of  the  use  of  thin  metal  films  are  for  MRAM,3  Flat 
Panel  Displays,  Advanced  Lithography,  etc. 


Wavelength  (nm) 
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The  optical  constants  of  metals  depend  on  the  fabrication  methods;  hence,  one  must  determine  the  optical  constants 
for  the  methods  used  rather  than  to  use  handbook  values.  The  dispersion  relationship  for  metal  films  often  cannot  be 
described  easily  with  an  equation  and  the  values  must  be  determined  for  each  wavelength  used.  The  difficulty  is 
that  for  a  single  film,  the  problem  is  underdetermined  and  there  are  many  combinations  of  thickness  and  optical 
constants  with  will  give  equally  good  fits  to  the  measured  data.  One  of  the  most  powerful  analysis  methods  in 
ellipsometry  is  to  have  several  films  of  the  same  material,  differing  only  in  thickness.  We  asked  the  fabricator  to 
deposit  four  films  with  target  thicknesses  in  the  range  of  50  -  300A.  The  ellipsometric  spectra  and  resulting  optical 
constants  are  shown  below.  Calculated  values  are  also  shown  for  zero  thickness  and  opaque  films. 


Wavelength  (nm) 


Wavelength  (nm) 


Following  the  analysis,  the  optical  constants  are  formatted  for  the  metrology  tool  and  then  used  in  the  reflectometry 
mode.  Note  that  whereas  a  dielectric  film  with  thickness  less  than  100A  does  not  produce  a  response  significantly 
different  from  no  film,  a  metal  film  a  few  tens  of  angstroms  will  change  the  reflectometry  response  significantly. 


As  a  final  example,  we  show  a  possibility  for  quality  control 
in  a  fab.  Let  us  consider  an  amorphous  carbon  film  deposited 
by  PECVD  with  methane,  a  small  amount  of  nitrogen,  and 
argon.  This  material  has  very  little  hydrogen  and  no 
crystalline  structure.  As  deposited,  the  carbon  has  primarily 
sp3  bonding.  When  it  is  annealed,  some  of  the  bonding 
changes  to  sp1 2.  The  resulting  optical  constants  are  shown  to 
the  right  after  annealing  at  temperatures  ranging  from  400  - 
1000°C.  The  material  was  modeled  using  the  Cauchy/Urbach 
dispersion  equations.  It  is  expected  that  in  the  fab,  this 
measurement  would  be  made  with  a  spectroscopic 
ellipsometer  with  a  fixed  number  of  wavelengths  using  a 
CCD  array.  The  function  of  the  characterization  lab  would  be 
to  provide  the  seed  parameters  for  a  Cauchy  analysis.  The 
optical  constants  determined  in  the  fab  would  be  used  for 
quality  control  of  the  annealed  material. 


Wavelength  (nm) 
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2.  G.  E.  Jellison,  Jr.  and  F.  A.  Modine,  Appl.  Phys.  Lett.,  69,  371  (1966),  Erratum,  Appl.  Pys.  Lett,  69,  2137  (1996). 

3.  H.  G.  Tompkins,  T.  Zhu,  and  E.  Chen,  J.  Vac.  Sci.  &  Technol.  A,  16,  1297  (1998). 
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Optical  Sensing  Opportunities  In  Production  MBE 

Keith  R.  Evans  and  Steven  P.  Roth 
IQE  Incorporated 
1 19  Technology  Drive 
Bethlehem,  Pennsylvania  18015 
Tel/Fax:  610-861-6930/5273 
Email:  kevans@iqep.com  and  sroth@iqep.com 

MBE  has  grown  from  a  fundamental  research  tool  for  investigating  gas- solid  interactions 
in  the  1970’s,  to  a  basic  R&D  tool  for  developing  advanced  semiconductor  device 
technologies  in  the  1980’s,  to  a  highly  successful  commercial  manufacturing  tool  in  the 
1990’s.  The  world  market  for  ni-V  devices  based  on  MBE  and  MOVPE  wafers  in  2000 
is  estimated  to  be  nearly  $9  billion.  We  estimate  that  nearly  1  million  MBE  wafers  will 
be  used  in  2000  for  a  variety  of  commercial  applications  including  those  associated  with 
optical  data  storage,  mobile  telephony,  satellite  communications,  fiber  optic 
communications,  and  wireless  communications. 

Success  brings  new  challenges.  The  large  and  growing  demand  for  MBE  wafers  required 
for  current  commercial  applications  and  for  development  of  future  applications  brings 
with  it  a  concomitant  need  to  better  wafers  more  rapidly  and  more  cheaply  (hence  the 
term  “better,  faster,  cheaper”).  Customers  not  only  want  low-priced  high-quality  wafers 
delivered  on  time;  they  want  reduced  delivery  times,  reduced  wafer  property  variability, 
and  continuous  price  reductions  over  time  and  volume.  Additionally,  as  device  and 
circuit  designers  strive  to  get  greater  performance  from  a  given  technology,  there  is  a 
tendency  to  narrow  wafer  specification  tolerances. 

Standard  new  generation  high  volume  MBE  systems  utilize  optical  sensors  for  platen 
position  sensing  and  transport  but  not  in  growth  parameter  sensing.  Still,  standard  new 
generation  MBE  systems  provide  significant  benefits  towards  realizing  better,  faster,  and 
cheaper  wafers.  Wafers  are  produced  faster  in  large  part  because  of  the  increased  number 
of  wafers  grown  at  one  time  (higher  throughput),  and  more  cheaply  because  of  reduced 
manpower  needs  (per  wafer)  and  also  because  of  higher  yields  (less  scrap).  Wafer  platen 
transfers  are  automated  thus  minimizing  human  induced  variations.  Wafers  are  made 
“better”  primarily  due  to  reduced  defects,  improved  flux  stability,  and  improved 
uniformities.  Such  improvements  brought  about  by  new  generation  reactors  are  primarily 
due  only  to  scaling  up  the  size  of  the  reactor.  Defects  are  lower  due  to  increased  source 
substrate  distance  and  modest  improvements  in  source  design.  Increased  flux  stability 
results  primarily  from  use  of  larger  cells,  which  have  larger  thermal  masses.  Fluxes  are 
still  initially  set  using  ion  gauges  together  with  flux  vs.  source  thermocouple  setpoint 
curves  with  thermocouple  setpoints  further  adjusted  based  on  ex- situ  wafer 
characterization  results.  Calibration  wafers  and  yielded  device  wafers  (i.e.,  those  that  fall 
outside  of  wafer  specification  tolerances)  add  significant  cost  to  the  MBE  wafer 
production  effort. 
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Product  mix  is  an  important  variable  in  assessing  opportunities  for  quality  enhancement 
and  cost  reduction.  When  MBE  production  systems  are  used  to  produce  different 
products  within  a  campaign,  it  is  necessary  to  calibrate  and  qualify  the  system  for  each 
new  product  lot  upon  completion  of  the  growth  of  the  last.  The  ability  to  move  from 
product  to  product  as  efficiently  as  possible  suggests  a  potentially  greater  role  for  optical 
sensors  in  a  high  product  mix  scenario. 

Optical  sensing  has  the  potential  to  enable  significant  cost  reduction  and/or  quality 
enhancement  via:  1)  the  ability  to  impact  wafer  yield  (number  of  wafers  meeting 
customers  spec  divided  by  the  total  number  of  wafers  grown),  and  2)  wafer  throughput 
(rate  of  producing  wafers  that  meet  customer  spec).  Improvements  in  yield  and 
throughput  generally  positively  impact  delivery  times  as  well.  Key  technological  and 
capacity  issues  that  directly  impact  MBE  process  yield  and  throughput  and  for  which 
optical  sensors  could  play  a  role  include: 

-  Vacuum  integrity 

-  Reproducibility  and  reliability  of  source  shutters 

-  Lateral  and  temporal  variation  in  wafer  temperature 

-  Substrate  surface  cleanliness,  crystallinity,  and  morphology 

-  Operational  efficiency  of  major  heat  removal  mechanisms  (e.g.,  water  and  LN2) 

-  Buildup  of  source  materials  and  coupling  with  mechanical  disturbances  which 
could  affect  wafer  defect  formation  via  particulate  generation 

-  Source  flux  compositional  makeup  including  impurities 

-  Lateral  and  temporal  variation  in  incident  flux 

-  Source  depletion 

MBE  wafer  suppliers  naturally  focus  cost  reduction  and  quality  enhancement  efforts  on 
opportunities  for  which  the  largest  potential  return  on  investment  (ROI)  exists;  not  all  of 
such  opportunities  will  benefit  from  optical  sensing.  To  calculate  ROI  for  sensor 
implementation,  costs  and  benefits  must  be  understood  in  detail.  Detailed  and  accurate 
ROI  calculations  require  access  to  MBE  growth  yield  and  throughput  data  taken  under 
appropriately  controlled  conditions  which  most  closely  match  those  of  the  MBE 
production  system  under  consideration  and  for  the  associated  manufacturing  scenario. 
Such  data  largely  does  not  exist  today  for  standard  new  generation  MBE  systems  yet 
must  be  generated  before  ROI  analysis  can  be  completed.  Each  party  -  optical  sensor 
supplier,  MBE  system  supplier,  and  MBE  wafer  supplier  -  is  motivated  to  understand  the 
benefits  of  optical  sensing.  However,  the  costs  of  the  underlying  experiments  are  high 
and  significant  issues  of  intellectual  property,  confidentiality,  and  propriety  are  present, 
all  of  which  present  barriers  to  developing  a  useful  ROI  analysis.  For  these  reasons  it  is 
proposed  that  a  strategic  partnership  be  formed  between  optical  sensor  supplier,  MBE 
wafer  supplier,  and  MBE  system  supplier,  which  balances  issues  of  development,  costs, 
intellectual  property,  confidentiality  and  propriety.  Such  a  partnership  provides  a  vehicle 
for  carrying  out  maximally  relevant  experiments  that  in  turn  will  lead  to  a  cost  benefit 
analysis  that  is  appropriate  for  taking  optimal  decisions  regarding  implementation  of 
optical  sensors  in  next  generation  standard  MBE  production  systems. 
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In  Situ  Raman  Spectroscopic  Characterization  of  Compound  Semiconductor  Free  Carrier 

Concentration 

J.  E.  Maslar 

Chemical  Science  and  Technology  Laboratory,  National  Institute  of  Standards  and  Technology 

100  Bureau  Drive,  Stop  8360 
Gaithersburg,  MD  20899-8360 

C.  A.  Wang  and  D.C.  Oakley 

Lincoln  Laboratory,  Massachusetts  Institute  of  Technology 
244  Wood  Street 
Lexington,  MA  02420-9108 

Raman  spectroscopy  has  been  utilized  with  great  success  for  ex  situ  characterization  of  III-V 
semiconductor  epitaxial  layers  since  it  provides  information  about  properties  such  as  alloy 
composition,  lattice  mismatch,  and  crystallinity,  as  well  as  free  carrier  concentration  and 
mobility.[l]  However,  it  has  been  utilized  infrequently  for  in  situ  characterization  because  of  the 
low  sensitivity,  high  cost,  and  large  footprint  of  such  systems.  More  recently,  developments  in 
detectors,  filters,  and  laser  technology  now  make  Raman  spectroscopy  more  attractive  as  an  in 
situ  probe.  The  objective  of  this  work  is  to  develop  and  evaluate  Raman  spectroscopy  as  an  in 
situ,  spatially  resolved  probe  of  compound  semiconductor  electrical  properties  for  use  in  process 
monitoring  and  control. 

In  the  first  phase  of  this  investigation,  ex  situ  spectra  of  doped  and  undoped  films  of  GaSb, 

GaAs,  InGaAs,  and  InP  were  measured.  Such  semiconductor  materials  were  selected  because 
the  different  materials  properties  present  different  challenges  in  the  interpretation  of  the  Raman 
spectra.  Spectra  were  generated  in  a  near-backscattering  geometry  with  radiation  from  argon  and 
krypton  ion  lasers,  appropriate  holographic  filters,  a  0.46  m  imaging  spectrograph,  and  a  back- 
illuminated,  thermo-electrically  cooled,  charge  coupled  device  camera.  Figure  1  shows  ex  situ 
Raman  spectra  at  room  temperature  of  two  InGaAs  layers  with  carrier  concentrations  of 
8  x  1017  cm'3  and  3  x  1018  cm3  as  determined  by  electrical  measurements.  The  coupled  phonon- 
plasmon  mode  is  easily  observed  in  each  spectrum.  The  frequency  of  this  mode  increases  with 
increasing  free  carrier  concentration  and  provides  the  basis  for  optical  determination  of  carrier 
concentration. 

In  contrast  to  Hall  measurements,  Raman  spectroscopy  does  not  require  fabrication  of  electrical 
contacts  and  can  be  performed  on  conducting  substrates.  This  latter  aspect  of  Raman 
spectroscopy  is  particularly  important  when  measuring  carrier  concentrations  of  materials  for 
which  semi-insulating  substrates  are  difficult  to  obtain  such  as  for  GaSb.  In  this  investigation, 
the  carrier  concentrations  of  doped  GaSb  on  semi-insulating  GaSb  substrates  were  measured. 
This  is  significant  because,  while  actual  GaSb-based  device  structures  are  grown  on  GaSb 
substrates,  test  structures  for  GaSb  carrier  concentration  and  mobility  measurements  are  grown 
on  semi-insulating  GaAs  substrates.  The  assumption  is  made  that  the  carrier  concentration  and 
mobility  will  be  identical  for  layers  of  nominally  identical  dopant  concentration  regardless  of 
substrate.  Our  Raman  measurements  show  that  this  is  not  always  the  case. 
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In  the  second  phase  of  this  investigation,  the  effects  of  heating  the  films  will  be  examined  by 
making  in  situ  measurements  on  selected  well  characterized  epitaxial  layers.  The  films,  which 
have  previously  been  grown,  will  be  placed  in  an  optically-accessible,  vertical  flow,  rotating-disk 
thermal  chemical  vapor  deposition  reactor,  and  Raman  spectra  will  be  recorded  from  room 
temperature  up  to  typical  growth  temperatures  of  around  600  °C.  The  reactor  will  be  maintained 
under  typical  conditions  for  deposition,  150  torr  of  helium,  although  no  growth  will  be 
performed. 

[1].  B.  Prevot  and  J.  Wagner,  “Raman  Characterization  of  Semiconducting  Materials  and 
Related  Structures,”  Prog.  Crystal  Growth  and  Charact.  1991,  Vol.  22,  pp.  245-319. 


Figure  1 .  Ex  situ  Raman  spectra  at  room  temperature  of  two  InGaAs  layers  with  different  free 
carrier  concentrations. 
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TuB2.3 

11:30am-1 2:00pm 


TURNKEY  INDUSTRIAL  INSTRUMENTATION  FOR  FAST-RESPONSE,  ON-LINE  ANALYSIS  OF  PPB 
IMPURITIES  IN  THE  ELECTRONIC  SEMICONDUCTOR  GASES 

Richard  T.  Mever.  Jorge  E.  Perez,  and  Shawn  Strickland,  CIC  Photonics,  Inc.,  3825  OsunaRd 
NE,  #6,  Albuquerque,  NM  87109,  505-343-9500,  www.ireas.com 

A  totally  integrated  hardware  and  software  gas  analysis  system,  IRGAS™,  is  newly  available  to 
the  semiconductor  process  engineer  to  monitor  the  ppb  quality  level  of  the  corrosive  and  toxic 
cleaning  and  etching  gases  in  real-time  and  on-line  with  process  tools  in  a  fab  plant.  The 
IRGAS™  system  is  based  upon  an  industrial-ruggedized  FTIR  spectrometer  coupled  with  a 
nickel-plated  stainless-steel  long-path-gas-cell  containing  stainless-steel  mirrors  coated  with  a 
heavy-layer  of  reflective-gold  over  a  thick-nickel  substrate  for  chemical  corrosion  resistance.  An 
electronically-operated  gas  handling  manifold,  pumping  system,  and  computer  display  package 
are  provided  within  a  purgeable  stainless-steel  enclosure  for  user  safety  protection;  see  Figure  1. 
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Figure  1  Block  Diagram  of  IRGASTM  System  Components  Managed  by  SPGAS™  Software 


The  SPGAS™  software  provides  both  (1)  total  operational  management  of  all  hardware 
components  and  (2)  an  innovative  weighted  multi-band,  multi-component  classical  least  squares 
analysis  of  impurity  gases  from  an  on-board  set  of  gas  calibration  data.  The  SPGAS™  permits 
the  process  engineer  to  monitor  any  number  of  impurities  simultaneously  with  preset  ppb  ranges 
specified  for  normal,  alert,  and  out-of-control  indicators,  alarms,  or  signals  to  a  master  computer; 
see  Figure  2  for  a  typical  screen  display.  Both  real-time  and  past-time  readouts  and  records  are 
also  available  for  gas  cell  temperature,  pressure,  and  species  concentrations. 
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Figure  2  Functional  Schematic  of  SPGAST.M  Methodology  for  Impurity  Gas  Analysis 

Also  available  is  the  SpectraStream™  algorithm  set  which  provides  for  immediate  early-detection 
of  changes  in  impurity  levels  within  seconds.  This  feature  actually  generates  two  time-lines  of 
output  from  the  FTIR  spectrometer:  the  standard  multi-scan  analysis  which  generates  low-noise 
data  on  the  time-scale  of  several  minutes  and  a  unique  scan-by-scan  analysis  which  displays  data 
within  seconds  of  actual  time.  This  fast-response  feature  enables  the  user  to  alter  or  stop  a 
process  operation  before  major  losses  of  product  occur.  Figure  3  shows  a  simulation  for  the  early 
detection  of  moisture  as  an  impurity. 
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Figure  3  Illustration  of  Fast-Response  Detection  of  Moisture  by  SpectraStream™ 

This  turnkey  system,  which  is  available  in  four  configurations,  can  be  applied  at  multiple  points 
of  critical  gas  purity  management  within  a  semiconductor  wafer  manufacturing  plant. 


TuB3.1  (Invited) 
1:30pm-2:15pm 


Nonlinear  optics  and  spectroscopic  ellipsometry  as  complementary  sensors  to  monitor 

and  control  SiGe  growth 

L.  Mantese1,  P.T.  Wilson1,  K.  Selinidis2,  D.  Lim!,  Y.  Jiang1,  J.D.  Canterbury 1 ,  J.G. 

Ekerdt \  M.  C.  Downer 1 

department  of  Physics,  University  of  Texas,  Austin,  TX787J2 
2 Department  of  Chemical  Engineering,  University  of  Texas,  Austin,  TX  78712 

Due  to  their  unique  sensitivity  to  surfaces  and  interfaces,  nonlinear  optical  techniques 
such  as  second  harmonic  (SH)  and  sum  frequency  generation  (SFG)  have  emerged  as 
powerful  and  highly  versatile  spectroscopic  probes.  With  the  recent  availability  of 
commercial,  tunable,  femtosecond  laser  systems,  in-situ,  real-time  monitoring  of  the 
optical  SH  responses  of  Si(001)  and  SiGe(OOl)  systems  has  become  possible.  Such 
investigations  have  included  single-wavelength  SHG  monitoring  of  CVD  growth 
chemistry,  including  H  coverage  and  desorption  at  Si(001)  in  real-time  [1,2]. 
Spectroscopic  SH  information  around  the  Ei  region  of  Si  has  demonstrated  adsorbate- 
specific  sensitivity  to  ML  coverages  (Fig.  1). 


H/Si(001)  Ge/Si(001)  B/Si(001) 


Figure  1 .  SH  spectra  of  ML  and  sub  ML-covered  2xl-Si(001). 

The  Ei  peak  of  Si  shifts  with  increasing  Ge  composition,  as  observed  in  both  SH  spectra 
[3]  and  spectroscopic  ellipsometry  (SE).  This  information  can  be  exploited  for  growth 
control,  by  calibrating  the  dielectric  function  of  SiGe  with  Ge  composition  and  using  the 
virtual  substrate  approximation  (VSA)  developed  by  Aspnes  [4,5].  Real-time  growth 
control  of  compositionally-graded  SixGei_x  has  been  demonstrated  using  SE  [6].  A 
similar  approach  is  suggested  by  the  sensitivity  of  SH  to  Ge  composition  (Fig.  2). 
However,  the  SH  spectra  acquired  in  several  (~10)  minutes  by  laser  tuning  are 
unacceptable  for  real-time  growth  applications.  To  overcome  this  problem,  Wilson  et  al. 

[7]  have  developed  a  technique  for  rapid  (~1  sec)  spectroscopic  detection  of  broadband 
SH  spectra  generated  by  a  ~10  fs  (~90  nm  bandwidth)  laser  pulse.  Real-time 
measurement  and  growth  control  using  spectroscopic  SH  is  thus  feasible. 

In  addition,  the  SH  responses  of  doped  Si(001)  samples  have  been  investigated 

[8]  and  shown  to  be  uniquely  sensitive  to  bulk  B  doping.  Upon  in-situ  B  doping  to 
~5xl018/cm3,  the  SH  signal  increases  by  about  five  times  and  the  peak  blue  shifts  toward 
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3.4  eV.  An  issue  of  immediate  technological  concern  that  SH  may  prove  able  to  address 
is  A-scale  gate  dielectric  information.  Erley  and  Daum’s  [9]  SH  results  from 


2  Photon  Energy  (eV) 


Figure  1.  Left  panels:  Linear  growth  profile  from  15%  to  35%  Ge  composition  using  SE  and  VSA 
analysis  with  feedback  control  of  the  disilane  valve.  The  inset  corresponds  to  the  disilane  and 
germane  valve  positions  during  growth.  The  lower  left  panel  shows  corresponding  post-growth 
SIMS  data  for  verification.  Right  panels:  SH  responses  of  SiGe  with  increasing  Ge  composition. 

The  thin-solid  lineshapes  were  measured  using  a  tuned  laser  output  (acquisition  time  ~10  min)  and 
the  heavy-dashed  lineshapes  measured  using  a  broadband  laser  source  (acquisition  time  ~lsec) 

Si02/Si  surfaces  show  a  strongly  process-dependent  response  near  3. 6-3. 8  eV.  A  similar 

investigation  of  alternative,  novel  dielectrics  using  SFG  is  under  investigation. 

[1]  J.  I.  Dadap,  N.M.  Russell,  Z.  Xu,  X.F.  Hu,  O.A.  Aktsipetrov,  J.G.  Ekerdt  and  M.C. 

Downer,  Phys.  Rev.  B  56  (1997)  1. 

[2]  X.F.  Hu,  Z.  Xu,  D.  Lim,  M.C.  Downer,  P.S.  Parkinson,  B.  Gong,  G.  Hess,  and 

J.G.  Ekerdt,  Appl.  Phys.  Lett.  71  (1997)  1376. 

[3]  P.  Parkinson,  D.  Lim,  J.  G.  Ekerdt,  and  M.  C.  Downer,  Appl.  Phys.  B  68  (1999)  1 . 

[4]  D.  E.  Aspnes,  W.  E.  Quinn,  M.  C.  Tamargo,  M.  A.  A.  Pudensi,  S.  A.  Schwarz,  M.  J. 

S.  P.  Brasil,  R.  E.  Nahory  and  S.  Gregory,  Appl.  Phys.  Lett.  60,  (1992)  1244. 

[5]  D.  E.  Aspnes,  Jour.  Opt.  Soc.  Am.  A  10  (1993)  974. 

[6]  L.  Mantese,  K.  Selinidis,  P.T.  Wilson,  D.  Lim,  Y.  Jiang,  J.G.  Ekerdt  and  M.C. 

Downer,  Appl.  Surf.  Sci.  154-155  (2000)  229. 

[7]  P.T.  Wilson,  Y.  Jiang,  O.A.  Aktsipetrov,  E.D.  Mishina,  and  M.C.  Downer,  Opt.  Lett. 

24(1999)496. 

[8]  D.  Lim,  M.C.  Downer,  J.G.  Ekerdt,  N.  Arzate,  B.S.  Mendoza,  V.I.  Gavrilenko  and 

R.Q.  Wu,  Phys.  Rev.  Lett.  (2000)  in  press. 

[9]  G.  Erley  and  W.  Daum,  Phys.  Rev.  B  58  (1998)  R1734. 
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2:15pm-3:00pm 

SCATTEROMETRY  APPLIED  TO  MICROELECTRONICS  PROCESSING 


John  R.  (Bob)  McNeil 
Center  for  High  Technology  Materials 
Univ.  of  New  Mexico 
Albuquerque,  NM  87131 


In  the  manufacture  of  semiconductor  devices,  storage  media,  and  in  other  areas,  the  creation  of  structures 
manufactured  to  pre-defined  dimensional  tolerances  is  required.  It  is  therefore  essential  to  have  the  ability  to  obtain 
a  precise  as  well  as  accurate  measurement  of  the  dimensions  of  these  manufactured  structures.  Precision  refers  to  the 
ability  to  repeatedly  obtain  the  same  dimensional  measurement  while  accuracy  is  concerned  with  the  traceability  of 
the  measurement  to  a  fundamental  dimensional  standard.  Dimensional  metrology  of  semiconductor  structures  has,  to 
date,  primarily  been  accomplished  through  the  formation  of  an  image  of  the  structure.  The  image  may  be  recorded, 
printed  or  displayed  using  a  plethora  of  devices  currently  available  for  this  purpose.  The  formation  of  an  image 
requires  the  recording  of  the  interaction  of  an  incident  radiation  or  probe  with  the  object  whose  image  is  required. 

This  past  decade  has  also  seen  the  development  of  a  non-imaging  optical  dimensional  metrology  technique 
known  as  scatterometry.  Scatterometry  is  a  non-destructive  optical  technique  that  records  and  analyzes  changes  in 
the  intensity  of  light  reflected  from  a  periodic  scattering  surface.  By  measuring  and  analyzing  the  light  diffracted 
from  a  patterned  periodic  sample,  the  dimensions  of  the  sample  itself  can  be  measured.  Scatterometry  exploits  the 
sensitivity  of  diffraction  from  a  sample  to  changes  in  the  line-shape  of  the  sample.  The  use  of  an  elementary  sample 
illumination  system,  combined  with  a  rigorous  scattered  signal  analysis  procedure  presents  a  significant  advantage 
over  conventional  metrology  techniques.  These  will  be  discussed  in  detail  in  this  paper. 

Applications  of  scatterometry  technique  have  included  characterization  of  photomasks';  monitoring  stepper 
focus2  and  dose3;  and  the  photoresist  post  exposure  bake  process4;  and  even  the  characterization  of  three- 
dimensional  features  such  as  contact  hole  and  DRAM  arrays5.  The  most  widely  reported  scatterometry  data  have 
been  for  measurements  of  developed  photoresist  and  etched  poly-Si  gratings6.  Like  other  optical  metrologies, 
scatterometry  measurements  are  rapid,  non-destructive  and  highly  repeatable. 

Although  the  types  of  features  mentioned  above  represent  a  considerable  portion  of  a  typical  process,  other 
types  of  materials,  such  as  metal  layers,  are  also  commonly  present,  and  are  important  to  characterize.  Recent 
scatterometry  measurement  data  from  metal  layers  will  be  presented;  see  Figure  1  below.  The  scatterometer  used  in 
this  investigation  was  the  so-called  2-©  variety,  where  the  scatter  signature  is  obtained  by  measuring  the  diffraction 
efficiency  of  a  particular  order  as  the  incident  angle  is  varied.  The  analysis  involved  comparing  the  measured  data  to 
a  library  of  theoretical  scatter  signatures  (generated  a  priori  from  rigorous  coupled  wave  theory)  to  find  the  best 
match.  The  scatterometer  measurements  agreed  well  with  AFM  measurements  performed  on  the  same  samples. 
Details  will  be  presented. 


350  A  TiN 
6000  A  A1C 

250  A  Ti 
3000  A  oxid 

Si  Substrate 


Figure  1.  The  post-etch  and  strip  stack  measured  in  this  study.  Note  the 
etch  process  removes  some  of  the  oxide  layer. 
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Another  primary  consideration  of  a  metrology  technique  is  the  measurement  repeatability.  The 
scatterometer  apparatus  is  simple,  consisting  of  a  laser  or  white  light  illumination  arrangement  and  a  simple 
detection  arrangement.  Because  of  this  simplicity,  the  technique  is  inherently  repeatable.  This  is  illustrated  below 
in  Table  1  for  the  AlCu  sample  discussed. 


Parameter 

name 

Repeatability 

(6-sigma) 

Reproducibility 

(6-sigma) 

Precision 

(6-sigma) 

Linewidth 

0.13  nm 

0.45  nm 

0.47  nm 

AlCu  Thk 

2.0  A 

2.36  A 

3.09  A 

Etched  Ox  Thk 

1.61  A 

1.70  A 

2.34  A 

UnifOx  Thk 

2.0  A 

2.02  A 

2.84  A 

Table  1.  Measurement  precision  data  for  the  etched  metal  samples. 


Measurement  precision  will  also  be  discussed.  Precision  is  dependent  upon  two  issues:  sensitivity  of  the 
measured  quantity  to  changes  in  sample  features,  such  as  how  the  measured  intensity  changes  with  changes  in  line- 
width,  and  the  measurement  precision  of  the  quantity  being  measured..  The  latter  is  dependent  on  the  particular 
scatterometer  technique  employed,  and  this  will  be  discussed. 
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Accuracy  and  Precision  of  Scatterometer  Measurements  Relative  to 
Conventional  CD  Metrology 

T.  Hayes  *,  R.  Bowley  *,  M.  Littau  +,  and  C.  Raymond  + 

IBM  Microelectronics  Division,  Essex  Junction,  VT  05452 
+  Bio-Rad  CD  Systems,  Albuquerque,  NM  87109 

Abstract 

Scatterometery  is  an  emerging  metrology  technique  for  measuring  critical  dimensions 
(CD),  profiles,  and  thicknesses  within  diffraction  gratings  printed  on  semiconductor  film  stacks. 
A  20  scatterometer  was  used  to  measure  diffraction  gratings  on  two  films  stacks:  1)  Front  end  of 
line  logic  gate  stack,  and  2)  Resist  on  anti-reflective  coating  (ARC)  on  Silicon.  The  CD 
measurements  of  the  scatterometer  were  compared  to  critical  dimension  scanning  electron 
microscope  (CD-SEM),  atomic  force  microscope  (AFM),  and  cross-section  scanning  electron 
microscope  (X-SEM)  measurements.  The  correlation  between  scatterometer  and  CD-SEM 
measurements  was  analyzed  using  the  Mandel  technique  of  Archie  and  Banke1  where  AFM  and 
X-SEM  measurements  were  used  to  characterize  the  “real”  properties  of  the  grating.  Initial 
results  indicated  that  the  measured  CD  value  by  scatterometer  and  CD-SEM  were  linearly 
correlated  with  a  sizeable  offset.  Further  analysis  of  the  correlation  shows  that  the  offset  was 
pronounced  at  grating  sidewall  angles  approaching  90°.  The  effect  of  sidewall  angle  on  CD- 
SEM  to  scatterometer  correlation  and  results  for  precision  and  accuracy  on  the  scatterometer  are 
discussed  relative  to  in-line  stepper  focus  control. 

Summary 

Semiconductor  device  performance  is  controlled  by  the  critical  dimension  of  features 
printed  by  lithographic  and  etch  processes.  Conventionally,  the  critical  dimension  scanning 
electron  microscope  (CD-SEM)  is  used  for  process  control;  however,  the  scatterometer  is  another 
technique  that  has  recently  become  available.  The  two  systems  work  on  entirely  different 
principles. 

The  CD-SEM  uses  a  focused  electron  beam  that  is  rastered  over  an  area  of  interest,  where 
by  secondary  and  backscattered  electrons  are  generated  everywhere  the  beam  imparts  the 
sample2.  The  generated  electrons  are  then  collected,  giving  a  waveform  (1-D)  or  image  (2-D)  of 
intensity  versus  position  which  can  be  analyzed  using  different  algorithms  to  extract  critical 
dimensions3.  The  operating  conditions  of  the  CD-SEM  are  such  that  higher  magnifications  are 
used  for  critical  dimension  measurements;  therefore,  measurements  are  performed  over  small 
segments  of  device  features. 

The  scatterometer  works  on  the  principle  of  light  diffracted  from  an  array  of  features.4,5 
At  several  angles  the  20  scatterometer  measures  the  intensity  of  the  0th  order  diffraction  intensity, 
which  will  vary  depending  on  the  film  stack  and  diffraction  grating  spacing.6  Rigorous  modeling 
of  the  optical  behavior  of  the  grating  as  a  function  of  angle  can  be  used  to  predict  the  intensity 
profile  for  most  film  stacks  and  grating  spacings.  By  comparing  the  measured  intensities  to  the 
modeled  data,  film  thickness,  critical  dimension,  and  sidewall  profile  information  can  be 
determined  using  the  scatterometer.  Due  to  the  laser  spot  size  and  other  optical  considerations, 
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the  scatterometer  requires  a  large  grating  of  80pm  x  120pm.  Therefore,  the  reported 
measurements  are  an  average  over  the  entire  grating  array. 

A  wafer  was  constructed  to  evaluate  the  correlation  between  the  CD-SEM  and  the 
scatterometer.  Diffraction  gratings  were  printed  in  UV-110  resist  on  a  proprietary  front  end  of 
line  film  stack.  Gratings  were  printed  with  nominal  linewidths  of  150nm  through  500nm  at 
pitches  of  2x,  2.5x  and  3x.  Each  grating  was  printed  at  various  focus  and  dose  conditions  using  a 
Nikon  203b  stepper  system.  Scatterometery  measurements  were  made  with  the  Bio-Rad  CDS-2 
system  and  compared  to  measurements  taken  on  the  AMAT  7830Si  CD-SEM.  CDS-2  and 
7830Si  measurements  were  linearly  correlated  with  a  regression  yielding  a  slope  of  1.07,  y- 
intercept  of  -40.2nm  and  a  correlation  coefficient  of  0.986;  however,  the  poor  correlation  was 
dominated  by  measurements  taken  on  features  with  near  90°  sidewall  angles.  Figure  1  shows 
that  the  offset  between  the  CD-SEM  and  scatterometer  varies  with  sidewall  angle,  indicating  that 
the  two  techniques  interact  differently  with  materials  being  measured.  The  sidewall  angle 
dependence  is  linear  between  86°  and  90°  and  can  therefore  be  extracted  from  the  correlations. 
When  sidewall  angle  information  is  accounted  for  the  linear  regression  between  systems  is 
greatly  improved,  yielding  a  slope  of  1.00,  y-intercept  of  9  A,  and  a  correlation  coefficient  of 
0.998.  This  effect  is  discussed  relative  to  accuracy  of  critical  dimension  measurements. 


Figure  1.  Plot  of  the  difference  in  measured  critical  dimensions  between  the  CDS-2  and  7830Si  versus  CDS-2 
determined  sidewall  angle.  The  increase  in  magnitude  of  the  offset  with  sidewall  angle  indicates  that  the  two  systems 
interact  differently  with  near  vertical  sidewalls. 
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Polarized  Light  Scattering  Techniques  for  Surface  Wafer  Inspection 
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Gaithersburg,  Maryland  20899-8442 

Laser  light  scattering  is  used  in  semiconductor  manufacturing  to  inspect  product  and  process-witness  wafers  for 
particulate  contaminants,  defects,  and  roughness.  Typically,  a  laser  beam  is  focussed  onto  the  wafer  surface,  and 
one  or  more  large  optics  collect  light  scattered  from  the  surface.  As  the  laser  beam  is  scanned  over  the  wafer 
surface,  background  signal  is  associated  with  microroughness,  while  excursions  from  the  background  are  associated 
with  localized  scattering  sources,  such  as  particles  or  subsurface  defects.  The  use  of  multiple  detectors  allows  some 
discrimination  amongst  different  classes  of  light  scattering  events. 

The  polarization  of  scattered  light  carries  with  it  information,  which  can  substantially  improve  the  ability  of  these 
instruments  to  characterize  the  light  scattering  sources.  This  information  is  often  not  fully  utilized  by  these 
instruments.  While  the  incident  laser  beam  is  usually  polarized  in  a  manner  to  accentuate  a  specific  scattering 
source,  analysis  of  polarization  of  the  resulting  scattered  light  is  rarely  performed.  This  summary  will  outline 
results,  which  have  shown  how  the  polarization  can  be  used  to  identify  scattering  sources. 

Light  scattered  by  small  amounts  of  roughness  can  be  interpreted  using  first-order  vector  perturbation  theory.  1 
That  theory  predicts  that  the  polarization  is  independent  of  the  details  of  the  surface  height  function,  and  that  no 
depolarization  occurs.  That  is,  the  differential  Stokes  vector  power  d?s  scattered  per  unit  solid  angle  dQ.  is  given 
by1 

=  C°S  C0E|2  0.  S(q)QP,  (1) 

where  X  is  the  wavelength  of  the  probing  light,  0t  is  the  incident  polar  angle,  Bs  is  the  scattered  polar  angle,  5(q) 
is  the  power  spectral  density  function  of  the  surface  height  function,  evaluated  at  surface  wavevector  q,  Q  is  a 
Mueller  matrix,  and  Pf  is  the  Stokes  vector  incident  power.  The  Mueller  matrix  Q  depends  only  upon  the  incident 
and  scattering  directions,  the  polarization  of  the  incident  light,  and  the  optical  constants  of  the  material,  and  is  non¬ 
depolarizing.  The  fact  that  roughness  in  the  smooth  surface  limit  does  not  depolarize  light  enables  any  scattering 
instrument  to  be  insensitive  to  roughness  by  placing  an  appropriately  aligned  polarizer  before  each  detector.  2  While 
the  signal  from  a  rough  surface  is  virtually  eliminated  by  the  presence  of  the  polarizers,  that  from  other  sources,  be  it 
from  particulate  contaminants  or  subsurface  defects,  is  reduced  by  a  much  lower  amount.  Calculations  of  integrated 
scatter  signals,  comparing  unpolarized  detection  with  polarized  roughness-blind  detection,  have  yielded  reductions 
of  the  minimum  detectable  diameter  for  small  particles  or  subsurface  defects  of  approximately  a  factor  of  two.  3 

Rough  surface  scatter  not  only  contributes  a  background  signal  when  no  localized  defects  are  present,  but  it  also 
interferes  with  signals  in  the  presence  of  defects.  In  any  specific  scattering  direction,  the  signal  when  the  laser  beam 
is  illuminating  a  particle  is  a  result  of  the  coherent  sum  of  the  fields  from  the  particle  itself  and  that  from  the  residual 
roughness.  Since  the  signals  are  added  as  fields,  not  intensities,  a  background  signal  whose  intensity  is  only  1  %  of 
the  signal  from  the  particle  will  contribute  as  much  as  10  %  to  uncertainty  of  the  net  signal.  With  the  polarized 
roughness-blind  configuration,  however,  the  signal  measured  by  the  instrument  only  includes  contributions  to  the 
scatter  from  the  particle,  thereby  reducing  or  eliminating  the  coherence  noise  caused  by  the  roughness.  This 
behavior  substantially  reduces  uncertainties  in  measurements  of  scatter  from  standard  diameter  polystyrene  latex 
spheres,  as  well  as  improves  the  characterization  of  unknown  defects. 

Many  surfaces  requiring  inspection  have  dielectric  layers  deposited  upon  them.  The  scatter  from  such  surfaces 
can  be  described  as  the  sum  of  the  scattered  fields  from  the  roughness  of  each  interface.  4  Whether  these  materials 
depolarize  the  light  or  not  is  related  to  the  degree  of  correlation  between  the  roughness  fimctions.  When  the  layers 
are  thin,  their  interfaces  tend  to  be  conformal,  and  the  scattering  is  non-depolarizing.  When  this  is  the  case,  the 
method  described  above  can  successfully  remove  the  entire  roughness-induced  signal.  For  thick  layers,  the 
roughness  of  each  interface  may  not  be  conformal.  This  lack  of  interfacial  conformity  may  yield  some 
depolarization,  limiting  the  ability  for  roughness-blind  instrumentation  to  be  fully  roughness-blind.  Any  specific 


0-7803-6252-7/00/$1 0.00©2000  IEEE 


41 


interface,  however,  will  contribute  a  polarized  signal,  so  that  roughness  from  a  single  interface  can  always  be 
removed.  A  roughness-blind  configuration  can  always  be  attained  for  a  specific  interface  in  a  dielectric  stack. 

Eq.  (1)  has  been  used  extensively  to  extract  roughness  functions  5(q)  from  angle-resolved  light  scattering 
measurement  results.  Interpreting  such  measurements  in  the  presence  of  dielectric  layers  is  difficult,  requiring 
assumptions  about  the  relative  roughness  levels  and  degree  of  correlation.  Since  the  scatter  polarization  from  two 
rough  interfaces  depends  upon  the  r  oughness  of  each  interface  and  the  degree  of  correlation  between  the  two 
interfaces,  angle-resolved  polarized  light  scattering  can  be  used  to  characterize  the  roughness  of  two  interfaces. 

Such  measurements  have  been  demonstrated  with  SiO  2  layers  thermally  grown  on  rough  silicon  and  show  the 
growth  interface  smoothening  as  the  layer  thickness  is  increased.  5 

Measurements  of  the  polarization  of  light  scattered  by  particulate  contaminants  can  provide  information  about  the 
size  of  those  contaminants.  Results  using  calibrated  polystyrene  latex  spheres  on  silicon  surfaces,  6  and  on  silicon 
surfaces  with  dielectric  layers,  7  have  shown  that  the  polarization  of  the  scattered  light  is  a  strong  function  of  the  size 
of  the  contaminants.  While  size  of  particulate  contaminants  is  typically  obtained  by  interpreting  the  intensity  of  the 
scattered  light,  the  results  can  only  be  correlated  to  that  scattered  by  calibration  particles,  which  are  typically 
polystyrene  latex  spheres.  The  results  of  theoretical  calculations  demonstrate  that  the  factor  that  dominates  the 
polarization  of  light  scattered  by  small  particulates  is  the  mean  distance  the  particle  lies  from  the  surface.  6,7 
Application  of  these  findings  should  substantially  improve  the  ability  for  light  scattering  instruments  to  characterize 
real-world  particulate  contaminants. 
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In-Situ  Spectral  Ellipsometry  Monitoring/Control  of  MBE  Growth 

K.  G.  Evink.  W.  T.  Taferner ,  K.  Mahalingam,  D.  L.  Dorsey,  S.  Adams,  Sensors  and 
Survivability  Directorate  AFRL/MLPO  Wright-Patterson  AFB,  OH  45433 


Spectroscopic  Ellipsometry  (SE)  has  received  considerable  attention  for  semiconductor 
growth  monitoring  and  control  due  to  its  ability  to  determine  the  thickness  and  composition  of 
thin  parallel  layers.  In  this  work,  we  report  on  the  use  of  spectroscopic  ellipsometry  for 
monitoring  and  control  of  AlGaAs  growth  in  molecular  beam  epitaxy.  Knowledge  of  the  optical 
constants  as  a  function  of  composition  and  growth  parameters  is  essential  for  modeling  to  be  able 
to  extract  the  composition  and  growth  rate  from  measured  SE  data.  For  this  work,  we  have  used 
the  composition  and  temperature 
dependent  optical  database  of  AlxGai_ 
xAs  developed  by  G.  Maracas1.  We 
have  used  the  virtual  interface 
approximation  to  extract  the  growth  rate 
and  composition  of  of  AlxGai.xAs  films 
grown  at  normal  and  elevated 
temperatures.  At  elevated  temperatures, 

Ga  possessed  a  sticking  coefficient  less 
than  1 .  Figure  1  is  a  plot  of  the  percent 
Ga  desorption  measured  by  SE  and 
desorption  mass  spectrometry.  Below 
900K  Ga  was  observed  to  grow  at  a 
constant  rate.  At  elevated  temperatures 
the  Ga  growth  rate  decreased  until  at 
940K  no  continued  growth  was  observed, 
observed  growth  rates. 

Our  control  software  was  modified  to  use  real-time  SE  data  to  target  pre-determined 
growth  rates  for  A1  and  Ga  by  adjusting  the  cell  temperatures.  As  part  of  this  effort  a  Guide 
Evolutionary  Stimulated 
Annealing  algorithm  has  been 
developed  for  real-time 
optimization  of  a  growth 
model2.  Figure  2  is  a  plot  of  an 
extracted  A1  composition  using 
the  GESA  algorithm  from  an 
AlxGai.xAs  growth  in  which  the 
target  composition  was  changed 
from  0.25  to  0.35.  After  an 
initial  settling,  the  composition 
tracked  with  a  precision  of 
better  than  0.005  in  the  standard 
deviation.  For  our  current 
applications,  compositional 


Temperature  (K) 

Figure  1 .  The  percentage  of  Ga  desorption  as  a 
function  of  growth  temperature 

Under  all  conditions  SE  accurately  predicted  the 
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Figure  2.  A  pot  of  the  extracted  composition  during  a  set  point  change 
from  0.25  to  0.35  was  given  to  the  control  system.  The  associated  Ga 
cell  temperature  is  also  given. 


control  of  1%  was  sufficient.  This  technique  and  the  standard  Marquardt-Levin  (M-L)  approach 
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gave  similar  precision  during  tracking.  A  small  difference  in  the  absolute  number  was  observed 
which  was  traced  to  the  smaller  range  of  wavelength  used  in  the  GESA  than  the  M-L  approach. 
It  was  also  observed  that  the  GESA 
algorithm  was  less  likely  to  get  stuck 
at  a  local  minimum  that  the  M-L 
method. 

Using  SE  to  accurately 
measure  in-situ  growth  rates  and 
compositions,  a  timing  sequence  for 
AlGaAs/GaAs  could  be  made.  A 
Be-doped  quantum  well  infrared 
photodetectors  structures  was  made 
with  the  SE  precalibration.  These 
QWIPS  achieved  the  target 
absorption  edge  of  ~9p  as  measured 
by  photoconductivity3.  This 
indicated  that  in-situ  pre-calibration 
using  SE  was  an  effective  approach 
for  growth  control.  The  ability  of 
in-situ  pre-calibration  to  grow 
compositonally  graded  structures 
was  also  tested.  We  grew  a  parabolic 
and  trapezoidal  AlxGai.xAs  structure 
using  the  dependence  of  the 
composition  and  growth  rate  on  the 
A1  and  Ga  effusion  cell 
temperatures.  The  effusion  cell 
growth  rate  dependence  was 
measured  using  the  in-situ  SE.  trapezoidal  structures  plotted  with  the  target  profile  (-).  Note 

Figure  3  shows  the  calculated  these  curves  are  inverted  from  those  in  f,Surc  3- 

temperature  profile  needed  to 

achieve  the  parabolic  and  trapezoidal  structures.  The  associated  SE  data  shows  that  the  in-situ  x 
composition  approximately  followed  the  expected  profile.  We  currently  believe  the  overall  poor 
SE  monitoring  is  due  to  constant  composition  of  the  film  used  in  the  virtual  interface  model.  We 
are  currently  exploring  this  assumption.  Figure  4  shows  the  SIMS  analysis  of  these  profiles.  As 
can  be  seen  the  structures  approximately  fit  the  compositional  profile  for  the  parabolic  structure. 
A  lag  in  the  cell  response  appears  more  evident  in  the  latter  trapezoidal  profile.  From  this 
analysis  it  is  not  certain  if  pre-calibration  is  adequate  for  graded  compositions  and  suggests  that 
real-time  feedback  maybe  required  for  grading. 


'  G.  Maracas  et  al  JVST  B12  (1994)  pl214. 

2  Z.  Meng  et  al,  Computational  Intelligence  Procedures  for  Monitoring  and  Control  of  MBE  Growth  of Arbitrary 
Film  Structures,  AIRTC  98  Conference  Proceedings.  October  5-8  (1998). 

3  Submitted  to  at  ICMCTF  2000  in  San  Diego 


Figure  3.  (Top)  Temperature  profile  used  to  make  a 
trapezoidal  and  parabolic  structure.  (Bottom)  Composition, 
xAI  extracted  in  real-time. 


Fgure  4.  The  SIMS(o)  depth  analysis  of  the  parabolic  and 
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Real-Time  Control  of  Substrate  Temperature  Using  Band  Edge  Thermometry 


S.  R.  Johnson,  M.  Beaudoin,  M.  Boonzaayer,  E.  Grassi,  and  Y.  H.  Zhang 
Center  for  Solid  State  Electronics  Research,  Department  of  Electrical  Engineering, 
Arizona  State  University,  Tempe,  AZ  85287-6206 
Tel:  602-965-2565,  Fax:  602-965-8118,  E-mail:  shane.johnson@asu.edu 


In  epitaxial  growth,  the  sensitivity  of 
material  properties  to  growth  temperature  means 
that  an  accurate  and  reliable  method  for 
measuring  substrate  temperature  is  essential. 
Recently,  optical  methods  have  been  developed 
that  overcome  some  of  the  limitations  of 
thermocouples  and  pyrometers  [1-3].  Pyrometry 
for  example,  does  not  work  at  low  temperatures, 
is  affected  by  surface  roughness,  window 
coating,  and  substrate  emissivity  changes  during 
growth.  Since  both  rotating  and  radiatively 
heated  substrates  are  preferred  during  epitaxial 
growth,  physical  contact  between  the  substrate 
and  the  temperature  sensor  is  not  practical. 
Therefore,  thermocouples  are  radiatively 
coupled  to  the  substrate  and  consequently  have  a 
large  temperature  offset  compared  to  the 
substrate.  Furthermore  this  offset  depends  on 
substrate  and  heater  temperature,  substrate  back 
surface  texture,  doping  level,  and  doping  type, 
and  on  the  changing  absorptance  and  emittance 
of  the  epilayer  during  growth. 


Fig.  1 .  The  temperature  dependence  of  the  onset  of 
transparency  of  a  350  pm  thick  InP:Fe  substrate. 

In  these  new  optical  methods,  substrate 
temperature  is  inferred  from  the  onset  of 
transparency  of  the  substrate  its  self.  The  band 
edge  of  HI-V  semiconductors  cuts  off 
exponentially  and  broadens  and  shifts  to  lower 
energies  with  temperature  [4].  This  band  edge 
shift  (or  bandgap  shrinkage)  is  linear  in 


temperature  above  the  Debye  temperature  (see 
Fig.  1).  In  practice,  substrate  temperature  is 
inferred  from  the  spectral  position  of  various 
points  in  the  diffuse  reflectance  spectrum  [1]  or 
transmission  spectrum  [2,3].  The  onset  of 
transparency  of  a  350  pm  thick  InP:Fe  substrate 
is  shown  in  Fig.  2.  Here,  the  onset  of 
transparency  (or  position  of  the  spectrum)  is 
defined  as  the  intersection  of  the  background 
signal  with  a  linear  extrapolation  through  the 
steepest  part  of  the  spectrum. 


Fig.  2.  The  position  of  the  spectrum  and  hence 
substrate  temperature  is  given  by  the  intersection  of 
the  asymptotes  of  the  knee  region  of  the  spectrum. 
The  knee  region  is  shown  by  the  solid  dots. 

The  nested  loop  approach  shown  in  Fig.  3 
is  a  convenient  way  to  add  an  accurate  external 
temperature  sensor  to  the  conventional 
Eurotherm-thermocouple  control  loop  found  in 
molecular  beam  epitaxy  systems.  In  this  case, 
the  outer  control  loop  updates  the  Eurotherm 
(thermocouple)  setpoint  based  on  the  difference 
between  the  user  setpoint  and  the  value 
measured  by  the  external  sensor,  using  a 
proportional-integral-derivative  (PID)  algorithm 
implemented  in  control  software.  This  method 
has  the  advantage  that  the  Eurotherm- 
thermocouple  control  system  is  left  intact,  so 
that,  in  the  event  the  external  sensor  or  control 
computer  fails,  the  heater  power  will  continue  to 
be  controlled. 
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Fig.  3.  Schematic  of  the  nested  proportional-integral- 
derivative  (PID)  control  loop  used  to  control 
substrate  temperature. 

The  Eurotherm-thermocouple  (or  inner) 
loop  primarily  controls  the  heating  element  and 
is  limited  in  its  ability  to  detect  and  control  true 
temperature  of  the  substrate.  This  inability  also 
means  that  the  Eurotherm-thermocouple  control 
loop  only  partially  rejects  external  disturbances, 
such  as  the  thermal-shutter  transients  that  occur 
at  the  substrate  when  effusion  cell  shutters  are 
toggled.  In  this  control  scheme,  the  outer  loop 
periodically  updates  the  thermocouple  setpoint, 
causing  the  Eurotherm  control  loop  to  bring 
substrate  temperature  back  in  line  with  its  target 
value. 

During  the  growth  of  absorbing  overlayers, 
radiatively  heated  substrates  are  known  to  heat 
up  dramatically  as  the  layer  becomes  thicker  [5]. 
This  occurs  under  either  constant  heater  power 
or  constant  thermocouple  temperature.  Under 
these  growth  conditions,  it  is  imperative  that  one 
has  an  alternative  non-contact  method  of 
determining  substrate  temperature,  such  as, 
diffuse  reflectance  spectroscopy  (DRS). 

Thermocouple  temperature,  substrate 
(DRS)  temperature,  and  heater  voltage  recorded 
during  the  growth  of  lattice-matched  InGaAs  on 
InP  are  shown  in  Fig.  4.  The  supervisory 
computer  samples  the  DRS  temperature  every 
30  s,  if  the  DRS  temperature  has  deviated  more 
than  ±  1  °C  from  the  substrate  setpoint  of 
435  °C,  the  outer  PID  loop  updates  the 
Eurotherm  (inner)  setpoint.  Throughout  most  of 
the  growth  of  the  InGaAs  layer,  the  substrate 
temperature  is  about  1  °C  above  its  setpoint. 
Decreasing  the  update  time  would  bring  the 
substrate  temperature  closer  to  the  setpoint 
during  the  first  part  of  the  growth.  As  can  be 
seen  in  Fig.  4,  each  time  the  thermocouple 
setpoint  is  updated,  there  is  a  sharp  drop  in  the 
heater  voltage  and  the  thermocouple 
temperature  quickly  moves  to  its  new  setpoint. 
The  thermal  response  of  the  thermocouple- 
heater  system  is  fast  (4  s  time  constant) 
compared  to  the  thermal  response  of  the 
substrate  (60  s  time  constant).  This  allows 
appropriate  control  action. 

The  thermocouple  is  insensitive  to  the 
substrate  temperature  increase  caused  by  the 


strong  absorption  of  heater  radiation  in  the 
overlayer.  The  small-bandgap  epilayer  absorbs 
a  larger  part  of  the  heater  radiation  spectrum 
than  the  InP  substrate,  which  is  transparent  to 
most  of  the  blackbody  spectrum  at  these 
temperatures.  Therefore,  as  the  InGaAs  layer 
thickens,  the  transmission  losses  of  heater 
radiation  are  reduced.  In  order  to  maintain 
constant  substrate  temperature,  the  control 
system  continually  reduces  the  heater  power  by 
decreasing  the  Eurotherm  (thermocouple) 
setpoint  over  the  course  of  the  growth. 
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Fig.  4.  Real-time  control  of  substrate  temperature 
during  the  growth  of  lattice-matched  InGaAs  on  semi- 
insulating  InP,  using  diffuse  reflectance  spectroscopy 
(DRS).  The  substrate  setpoint  is  435  °C. 


The  small  oscillations  in  the  substrate 
temperature  at  the  start  and  stop  points  are 
caused  by  the  abrupt  changes  in  the  thermal  load 
on  the  substrate  when  the  In  and  Ga  cells  are 
togged.  At  the  start  of  growth  the  thermocouple 
setpoint  is  reduced  by  6  °C  in  an  effort  to 
correct  for  the  thermal  disturbance  caused  by 
additional  radiant  heat  load  of  the  In  and  Ga 
effusion  cells.  This  is  the  simplest  way  of 
rejecting  thermal  disturbances.  At  the  end  of 
the  growth,  no  shutter  disturbance  rejection  is 
used  and  the  substrate  momentarily  drops  4  °C 
below  the  setpoint. 

[1]  S.  R.  Johnson,  T.  Tiedje,  J.  Crystal  Growth, 
175/176,  38(1997). 

[2]  J.  A.  Roth,  T.  J.  DeLyon,  M.  E.  Adel,  Mat. 
Res.  Soc.  Symp.  Proc.  324,  353  (1994). 

[3]  E.  S.  Heilman,  J.  S.  Harris,  J.  Crystal 
Growth,  81,  38(1987). 

[4]  S.  R.  Johnson,  T.  Tiedje,  J.  Appl.  Phys.  78, 
5609(1995). 

[5]  B.  V.  Shanabrook,  J.  R.  Waterman,  J.  L. 
Davis,  R.  J.  Wagner,  Appl.  Phys.  Lett.,  61, 
2338(1992). 
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Substrate  Temperature  Measurement  and  control  by  Emissivity  Compensated  Pyrometry  during 
Metalorganic  Vapor  Phase  Epitaxy  of  III-V  Device  Structures  in  Large  Scale  Rotating  Disc 

Reactors 

J.C.  Ramer,  B.  Patel,  V.  Boguslavskiy,  A.  Patel,  M.  Schurman,  A.  Gurary 
EMCORE  Corporation,  394  Elizabeth  Ave.,  Somerset,  NJ  08873 


The  growth  process  for  many  III-V  semiconductor  devices  is  quite  temperature  sensitive.  In  fact,  the 
temperature  window  for  some  of  the  more  challenging  device  structures,  such  as  InGaAsP/InP  infrared  laser  diodes, 
and  InGaP/GaAs  heterojunction  bipolar  transistors,  can  be  as  narrow  as  2-3  °C,  depending  upon  how  tight  the  final 
device  specifications  are.  Clearly,  measurement  and  control  of  the  growth  temperature  in  a  MOVPE  production 
environment  is  critical  to  maintaining  high  yields  from  the  growth  process. 

We  have  solved  this  problem  by  using  a  pyrometric  technique  known  as  Emissivity  Compensated 
Pyrometry  (ECP)  [1].  Simply  stated,  this  in-situ  measurement  technique  requires  the  combined  functionality  of  a 
conventional  pyrometer  and  a  reflectometer.  The  reflectance  of  the  substrate  is  measured  at  the  same  wavelength 
that  the  pyrometer  measures  the  thermal  radiance.  By  using  the  equation 

e  =  1-R  (1) 

the  spectral  directional  emissivity  of  the  substrate  surface  can  be  calculated  from  the  measured  spectral  directional 
reflectivity.  This  allows  the  measured  thermal  emission  to  be  constantly  corrected  for  the  changing  emissivity  of  the 
growth  surface.  In  this  way,  the  accurate  temperature  of  the  substrate  can  be  measured  at  any  time  during  growth. 

The  ECP  sends  measured  emissivity  and  temperature  values  to  a  Windows  NT  based  PC  running  a 
proprietary  sorting  and  averaging  algorithm.  Emissivity  and  temperature  values  are  sent  to  the  computer  every  25 
ms.  During  growth,  die  wafer  carrier  is  spinning  at  800  RPM,  meaning  that  measurements  will  be  made  both  on  the 
substrates,  and  on  the  wafer  carrier  surface.  Since  the  reflectometer  is  calibrated  to  perform  measurements  on 
specular  substrate  surfaces,  all  measurements  on  the  diffuse  wafer  carrier  surface  result  in  100%  emissivity 
measurements,  so  sorting  and  discarding  the  emissivity/temperature  measurements  made  on  the  wafer  carrier  surface 
is  quite  straightforward.  After  removing  measurements  made  of  the  wafer  carrier  surface,  the  system  displays  the 
temperature  and  emissivity  of  the  substrates  constantly  throughout  the  growth  run.  The  measurement  is  not 
synchronized  with  the  rotation  of  the  wafer  carrier,  so  the  measured  temperature  (and  the  emissivity)  is  an  average 
temperature  of  all  the  substrates  on  the  wafer  carrier.  The  directional  spectral  emissivity  measurement  is  calibrated 
by  loading  calibrated  undoped  Si  substrates  onto  the  wafer  carrier,  and  setting  the  emissivity  measurement  to  the 
known  spectral  directional  emissivity  of  undoped  Si  at  905  nm  (e  =  68%)  [2],  All  emissivity  measurements  made  by 
the  system  are  indexed  to  this  reference  point. 

The  growth  experiments  described  in  this  work  were  performed  on  an  EMCORE  Discovery  180  production 
MOCVD  reactor  [3].  A  6x2”  wafer  configuration  was  used  for  most  of  the  experiments,  although  much  data  was 
taken  using  3x3”,  and  1x6”  wafer  configurations  as  well.  It  is  important  to  note  that  much  commercial  laser  diode 
production  (A^Ga^As,  InxGai_xAsi_yPy,  and  InxGai.xP)  is  currently  done  on  reactors  of  this  size.  The  reactor  is  a 
vertical  flow  reactor,  with  3  normal  incidence  viewports  mounted  on  the  top  injection  flange.  The  viewports  are 
approximately  12  inches  from  the  growth  surface. 

An  example  of  the  emissivity  and  temperature  measurement  of  the  growth  surface  dining  a  deposition  run 
is  shown  in  the  data  in  Figure  1.  The  structure  grown  is  a  simple  InP/InGaAsP/InP  doubleheterostructure.  The 
quaternary  material  is  lattice  matched  to  InP  (  Aa/a  =  -500  ppm)  and  the  PL  wavelength  of  the  structure  is 
approximately  1310  nm.  Since  the  As:P  ratio  in  the  solid  material  is  very  sensitive  to  temperature  (  because  the 
pyrolysis  efficiencies  of  PH3  and  AsH3  are  very  different  at  common  growth  temperatures),  this  kind  of  structure  is 
an  excellent  epitaxial  metric  for  in-situ  temperature  measurement. 
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Figure  1:  Substrate  temperature  and  emissivity  data  for  a  InP/InGaAsP/lnP  lattice  match  calibration  structure .  The  lower 
curve  is  the  emissivity.  Note  that  the  entire  growth  takes  place  with  fixed  thermocouple  setpoints. 

In  order  to  determine  the  precision  of  the  ECP  temperature  measurement,  two  growth  runs  were  done  on 
different  substrates  using  the  same  thermocouple  setpoints.  As  seen  in  the  data  in  Table  I,  a  30%  increase  in  the 
doping  level  of  the  substrate,  in  this  case  Sulfur  doped  InP,  results  in  a  substrate  temperature  increase  of  2.5  °C, 
which  results  in  a  ~7  nm  PL  wavelength  shift.  With  the  in-si tu  temperature  measurement,  the  cause  of  the  change  in 
PL  wavelength  can  be  attributed  to  increased  free  carrier  absorption  in  the  substrate  with  the  higher  doping  level. 


substrate 

doping  level 

(cm'3) 

substrate  temperature  (°C) 

InGaAsP  PL  X  (nm) 

lnP:S 

5.4e18 

664.5 

1338.3 

lnP:S 

4.15e18 

662.0 

1345.0 

Table  /.:  Sensitivity  of  substrate  temperature  to  doping  level  in  lnP:S  substrates.  Identical  process  conditions  were  used  for  both 
deposition  runs;  the  only  difference  is  the  substrate  doping  level.  (Note:  higher  substrate  temperature  results  in  lower  As:P  ratio  in 

the  solid,  which  causes  the  lower  PL  wavelength.) 

The  precision  and  accuracy  of  this  temperature  measurement  technique  is  sufficient  to  ensure  that  the 
surface  temperature  of  the  substrate  can  be  held  within  a  window  of  +/-1  °C  for  many  months  of  continuous 
operation.  Results  from  temperature  sensitive  epitaxial  metrics  such  as  Zn  doped  InGaAlP  and  compositionally 
sensitive  InGaAsP  structures  show  that  the  temperature  of  the  substrate  measured  by  ECP  correlates  with  post 
growth  characterization  measurements  of  the  temperature  sensitive  properties  of  these  materials. 

It  is  also  worth  mentioning  that  since  the  ECP  system  measures  the  normalized  reflectance  of  the  wafer 
surface  during  growth,  the  data  generated  also  allows  the  calculation  of  the  growth  temperature  optical  constants  (  n 
and  k)  and  the  growth  rate  of  thick  layers.  Using  the  virtual  interface  method  of  Breiland  and  Killeen  [4]  to  fit  the 
reflectance  data,  we  estimate  that  the  growth  rate  can  be  determined  with  an  error  of  <  1%,  for  layers  thicker  than 
X/n. 

The  ECP  system  has  been  applied  to  a  wide  range  of  reactor  sizes,  from  75  mm  disc  to  400  mm  disc,  with 
similar  results.  The  same  hardware  and  software  configuration  can  be  used  on  any  reactor  size,  regardless  of  wafer 
size  or  arrangement  on  the  wafer  carrier,  since  there  is  no  synchronization  required  with  the  disc  rotation 
mechanism.  This  temperature  measurement  system  will  allow  users  of  vertical  rotating  disc  reactors  to  achieve 
much  tighter  control  of  temperature  sensitive  processes. 
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1. 1.P  Herman,  in  Optical  Diagnostics  for  Thin  Film  Processing,  (Academic  Press,  1995)  p.  609-614. 

2.  P.J.  Timans,  The  Thermal  Radiative  Properties  of  Semiconductors,  Advances  in  Rapid  Thermal  and  Integrated 
Processing,  (Kluwer  Academic  Publishers,  1996),  p.  35-101. 

3.  A.G.  Thompson,  Materials  Letters  30,  255  (1997). 

4.  W.G.  Breiland  and  K.P.  Killeen,  J.  Appl.  Phys.  78(11),  6726  (1995). 
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WD1.1  (Invited) 
l:30pm-2:00pm 

Lasers  and  Space  Optical  Systems  (LASSOS)  Study: 

Application  and  Enabling  Technologies 

C.  Giuliano,  R.  Berdine,  R.  Benedict,  M.  Hallada, 

D.  Johnston,  and  J.  Piotrowski 

Air  Force  Research  Laboratory/Directed  Energy, 

Schafer  Corporation, 

United  States  Air  Force,  Retired 

The  Laser  and  Space  Optical  Systems  (LASSOS)  study  gathered  inputs  from  military 
users  and  technologists  to  identify  over  a  hundred  promising  weapon  and  non-weapon 
applications  for  lasers  and  optical  systems  which  make  use  of  the  space  environment  - 
either  based  in  space  or  transiting  space.  Key  enabling  technologies  for  these 
applications,  as  well  as  roadmaps  to  develop  them,  as  needed,  were  also  identified. 
Thirty-six  high  interest  concepts  were  selected  during  Phase  I  of  the  study  and  nine  were 
examined  in  more  detail  in  Phase  II.  Potential  multi-mission  implementations  were 
investigated  and  potential  acquisition  strategies  and  demonstrations  were  suggested.  A 
more  detailed  investigation  of  the  multi-mission  utility  of  relay  mirrors  was 
recommended.  Study  results  were  briefed  to  the  Air  Force  Major  Commands,  from 
which  requirements  advocacy  was  sought.  Expanded  interactions  between  warfighters 
and  technologists,  as  experienced  during  the  study,  were  encouraged. 
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WD2.1  (Invited) 
2:00pm-2:30pm 


LASER  SPACE  COMMUNICATIONS 
Ronald  L.  Phillips 
Florida  Space  Institure 
(407)  730-2601 
FAX:  (407)  658-5595 
E-mail:  fsiccas@mail.ucf.edu 


Compared  with  conventional  RF  systems,  there  are  several  significant  advantages  offered 
by  laser  communication  (lasercom)  systems  in  space  applications  that  are  simple  consequences  of 
the  short  wavelengths  (high  frequencies)  associated  with  optical  waves.  Among  these  advantages 
are  (/)  smaller  antenna  (telescope),  (//)  smaller  size  and  weight  of  the  components,  ( iii )  power 
concentration  in  a  very  narrow  beam,  and  (fv)  the  potential  increase  in  modulation  bandwidth. 

The  last  advantage  is  particularly  important  because  the  amount  of  information  transmitted  by  a 
communication  system  is  directly  related  to  the  bandwidth  of  the  modulated  carrier,  which  is 
usually  some  fraction  of  the  carrier  frequency  itself.  Hence,  increasing  the  carrier  frequency  has 
the  potential  to  increase  the  information  capacity  of  the  system  (higher  data  rates).  Applications 
that  could  benefit  from  lasercom  are  those  that  have  platforms  with  limited  weight  and  space, 
require  very  high  data  links,  and/or  involve  deep  space  return  links. 

Although  lasercom  systems  offer  certain  advantages  over  RF  systems  as  described  above, 
there  are  some  drawbacks  that  arise  from  the  smaller  wavelengths  used  in  lasercom  systems.  For 
example,  the  high  directivity  of  the  transmitted  beam  makes  acquisition  and  pointing  more 
difficult.  Also,  optical  component  design  requires  its  own  technology  separate  from  design 
techniques  associated  with  RF  systems.  Thus,  performance  characteristics  can  be  significantly 
different  than  those  associated  with  RF  systems.  Optical  turbulence  resulting  from  small 
temperature  variations  in  the  atmosphere  gives  rise  to  power  losses  from  spreading  of  the  beam 
spot  radius  beyond  that  due  to  diffraction  alone,  and  to  temporal  and  spatial  fluctuations  of  the 
laser  beam  known  as  scintillation.  Small  pointing  errors  can  easily  lead  to  unacceptable  fade 
levels  owing  to  a  Gaussian  roll-off  in  the  mean  irradiance  profile  combined  with  large  off-axis 
scintillation. 
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WD2.3  FREE  SPACE  SATELLITE  LASER  COMMUNICATION 

3:30pm-4:00pm 

Adnan  Cora-Hacer  Atar 
Karadeniz  Technical  University 
Department  of  Electrical  and  Electronics  Engineering 
61080-TrabzonfTURKEY 
e-mail:  cora@ktu.edu.tr 


In  optical  communication  systems  an  electrical  signal  is  converted  to  a  modulated  light 
wave,  which  then  transmits  the  signal  to  a  receiver.  The  light  signal  is  converted  back  to  an 
electrical  signal  at  the  receiver.  Modulation  of  the  wave  is  necessary  in  order  to  convey 
information.  To  transmit  information  with  light  we  have  to  turn  it  on  and  off,  or  modulate  it  in 
some  other  way,  to  have  it  represent  some  sort  of  coded  message  that  is  recognizable  at 
the  receiver.  This  great  potential  of  light  waves  for  communication  purposes  has  been 
recognized  for  a  long  time,  but  became  practical  only  after  the  development  of  lasers  [1], 

On  of  its  applications  is  satellite  Laser  Communications.  Free  spare  optical 
communication  between  satellites  networked  together  can  make  possible  high  speed 
communication  between  different  places  on  the  Earth.  The  advantages  of  an  optical 
communication  system  instead  of  a  microwave  communication  system  in  free  space  are  1) 
smaller  size  and  weight,  2)  less  transmitter  power,  3)  large  bandwidth,  4)  higher  immunity 
to  interference.  The  main  complexity  of  satellite  optical  communication  is  the  pointing 
system  satellites  vibrate  continually  because  of  their  subsystem  operation  and 
environmental  sources.  Due  to  vibrations  to  receiver  receives  less  power.  This  effect  limits 
the  system  bandwidth  for  given  Bit  Error  Rate  (BER). 

Arnon,  Rotman  and  Kopeika  [2]  derived  an  algorithm  to  maximize  the 
communication  system  bandwidth  using  the  transmitter  telescope  gain  as  a  free  variable 
based  on  the  vibration  statistics  model  and  the  system  parameters.  Their  model  makes  it 
possible  to  adapt  the  bandwidth  and  transmitter  gain  to  change  of  vibration  amplitude. 

BANDWIDTH  MAXIMIZATION 


The  receiver  is  assumed  to  include  an  optical  detector  in  direct  detection  mode  with 
modulation  format  of  On/Off  Keying.  In  sush  systems  the  BER  for  an  optimal  threshold 
receiver  is  given  by 


BER 


fl 


V 


p-o(Q) 

V2(a1(0)  +  (CTO(0) 


JJ 


F(0)d0 


(i) 


where,  pi(0)  and  oi  (6)  are  the  received  optical  signal  and  the  receiver  noise  standard 
deviation  for  receiver  “1”,  respectively,  and  p0  (9)  and  cr0  (0)  are  the  received  optical  signal 
and  the  receiver  noise  standard  deviation  for  receiving  "0”,  respectively  . 

TRANSMITTER  AND  RECEIVER 

In  this  section  the  basic  elements  of  the  transmitter  and  the  receiver  defined.  Figs. 
1  and  2  describe  a  simple  scheme  of  transmitter  and  receiver.  The  receiver  (Fig.  1)  model 
includes:  telescope,  optical  band-pass  filter,  input  insertion  losses,  optical  amplifier,  output 
insertion  losses,  optical  band-pass  filter,  p-i-n  photo-diode,  electrical  filter,  and  a  decision 
rule  circuit.  The  transmitter  (Fig.  2)  model  includes:  laser  transmitter,  optical  phased-array 
telescope,  control  unit,  and  random  attenuation  (vibration  effects)  [3], 
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Fig.  2.  Transmitter  model 


The  optical  power  received  by  the 
receiver  satellite  is 


Pr=  Ki  Gt  (2) 

Where,  GT  is  the  transmitter  telescope 
gain  and  the  pointing  factor  is 

L=  exp  (Gt  92)  (3) 

Where  0  is  the  radial  pointing  error  angle. 
The  receiver  gain  is 


Fr= 


(  7tDR^ 


.  A.  J 

Where  DR  is 


diameter. 


the  receiver  aperture 


PRACTICAL  EXAMPLE 

The  scenario  considered  here  is  communication  between  two  LEO  (low  earth  orbit) 
satellites.  The  satellites  are  placed  at  an  orbit  altitude  of  about  1400  km.  The  distance 
between  satellites  is  assumed  to  be  4500  km.  The  important  of  the  communication 
systems  are  the  transmitter  power  is  1  W,  the  optical  wavelength  is  0,8  pm  and  the 
required  BER  is  10‘9.  This  analysis  points  out  bandwidth  maximization  for  satellite  laser 
communication  [4],  The  implementation  of  this  model  is  simple.  From  the  results  of  the 
example  it  is  easy  to  understand  the  importance  of  optimization  for  system  bandwidth.  This 
analysis  also  points  out  that  even  low  values  of  vibration  amplitudes  decrease  dramatically 
the  possible  system  bandwidth. 

References 

1.  R.  K.  Tyson,  “Adaptive  Optics  and  Ground  to  Space  Laser  Communications”,  Applied 
Optics.  35,  3640-3646  (1996). 

2.  E.  V.  Hosersten,  Optical  Communication  Theory,  Chap.  8,  pp.  1805-1863. 

3.  S.  Arnon,  S.  R.  Rotman,  and  N.  S.  Kopeika,  "Bandwidth  Maximization  for  Satellite 
Laser  Communication”,  IEEE  Trans.  Commun.  35,  675-682  (1999). 

4.  S.  Arnon,  “Vie  of  Satellite  natural  vibrations  to  Improve  Performance  of  Free-Space 
Satellite  Laser  Communication”,  Applied  Optics.  37,  5031-5036  (1998). 
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WD2.4  (Invited) 
4:00pm-4:30pm 


Photonics  for  Agile-Beam  Wideband  Antenna  Systems* 

L.  M.  Johnson,  G.  E.  Betts,  P.  A.  Schulz,  P.  W.  Juodawlkis  and  W.  G.  Lyons 

Massachusetts  Institute  of  Technology,  Lincoln  Laboratory 

244  Wood  St. 

Lexington,  MA  02420-9108 

For  well  over  a  decade,  rf/photonic  techniques  have  been  under  development  for 
realizing  wideband  agile-beam  rf  antenna  systems.  Much  of  this  work  has  centered  on  phased- 
array  systems  which,  compared  to  mechanically  scanned  antennas,  offer  important  performance 
features  including  multiple  independent  beams,  rapid  scanning,  and  wide  instantaneous 
bandwidth.  This  talk  will  describe  recent  advances  in  rf/photonic  architectures  aimed  at  realizing 
wideband  rf  array-antenna  systems.  While  hurdles  remain,  continuing  advances  in  photonic 
device  performance  offer  the  prospect  of  rf/photonic  antenna  systems  with  significant 
performance  advantages  over  all-rf  approaches. 

Photonics  can  positively  impact  the  design  of  rf  array  antenna  systems  in  several  key 
ways.  Large  arrays  require  complex  distribution  networks,  and  optical  fiber  offers  a  clear 
advantage  over  rf  coaxial  cable  or  waveguide  in  terms  of  size,  weight,  and  attenuation.  In 
addition,  true-time-delay  (TTD)  beamforming  is  required  for  squint-free  wide-bandwidth  array 
operation.  A  number  of  photonic  bemforming  techniques  are  under  development  (1-3)  that 
exploit  recent  advances  in  photonic  components  for  digital  fiber-optic  networks,  including  spatial 
switches,  fiber  Bragg  gratings,  tunable  lasers,  multiplexers  and  filters.  Addressing  the 
requirements  of  wideband  analog  signal  processing  and  distribution  remains  an  important 
challenge  in  adapting  digital  fiberoptic  technology  to  rf  antenna  systems. 

A  generic  rf/photonic  receive  antenna  architecture  is  shown  in  Fig.  1.  The  rf  signal  from 
each  element  or  subarray  modulates  an  optical  carrier  wave.  Time  delays  appropriate  for  the 
beampointing  direction  are  applied  to  each  subarray  output,  and  the  combined  signal  is  converted 
back  to  rf  in  a  photoreceiver.  For  TTD  beamforming,  hybrid  approaches  combining  rf  techniques 
for  fine-scale  TTD  and  photonic  methods  for  coarse  TTD  show  particular  promise.  For  high- 
performance  systems,  EO  conversion  is  typically  accomplished  using  a  CW  laser  and  external 
electrooptic  modulator.  With  components  available  today,  analog  modulation  bandwidths  of 
10’s  of  GHz  are  readily  achieved.  The  dc  power  requirements  of  rf-to-photonic  conversion  is  an 
important  system  consideration  for  space  applications  involving  large  arrays.  Depending  on 
system  dynamic  range  needs,  a  single  rf-to-photonic  converter  could  consume  in  excess  of  1  W 
with  current  technology.  Improvements  in  modulator  sensitivity  (reducing  V„)  and  linearity,  and 
in  laser  efficiency,  offer  the  prospect  of  reducing  dc  power  requirements  by  an  order  of 
magnitude. 

Photonic  TTD  architectures  can  be  realized  utilizing  a  combination  of  (1)  spatial 
switching  between  fiber  elements,  and  (2)  wavelength-addressable  delay.  Fiber-optic  lxm 
switches  with  low  insertion  loss  and  low  crosstalk  are  now  readily  available.  A  photonic  TTD 
design  that  combines  spatial  switching  and  wavelength  addressable  delay  is  shown  in  Fig.  2.  The 
design  exploits  recent  advances  in  optical  wavelength  multiplexing  technology.  A  fixed 
wavelength  is  assigned  to  each  input  and  the  signals  are  combined  through  a  wavelength 
multiplexer  on  a  single  fiber.  A  lxm  switch  selects  a  fiber  Bragg  grating  which  applies  the 
appropriate  time  delay  to  each  wavelength  component.  Systems  can  be  configured  for  two- 
dimensional  beam  steering.  Photonic  beamformers  of  this  type  offer  a  large  size  and  weight 
savings  over  all-rf  designs  of  comparable  performance. 

In  summary,  rf/photonic  techniques  which  exploit  the  small  size  and  weight  of  optical 
fiber,  the  wide  bandwidths  of  optoelectronic  components,  and  the  unique  capabilities  offered  by 
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wavelength  multiplexing  offer  very  significant  opportunities  for  realizing  high-performance 
array  antenna  systems. 

*  This  work  was  sponsored  by  the  Department  of  the  Air  Force  under  Air  Force  contract 
F19628-95-C-0002.  Opinions,  interpretations,  conclusions,  and  recommendations  are  those  of 
the  author  and  are  not  necessarily  endorsed  by  the  United  States  Air  Force. 
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Fig.  1.  Schematic  view  of  an 
rf/photonic  phased-array  receive 
antenna.  Subarray  rf  output 
signals  modulate  an  optical 
carrier  wave  (EO  converter)  and 
the  signals  are  sent  via  optical 
fiber  to  a  photonic  true-time- 
delay  (TTD)  beamforming 
network.  The  combined  signal 
is  converted  back  to  rf  to 
provide  the  beam  output. 


10 


WD3.1  (Invited) 

4:30pm-5:00pm 

A  Concept  for  Very  Large  Telescopes  Using  Pupil  Plane  Correction  Optics 
And  Lower-Cost  Spherical  Primary  Mirror  Segments 


Nelson  Wallace 

Electro-Optics,  Lasers,  and  Research  Center 
Space  &  Technology  Division 
TRW  Space  &  Defense,  Redondo  Beach,  CA  90278 


Abstract 

For  very  large,  space-erectable  optical  systems  (NASA  science  missions, 
high  energy  laser  weapons,  etc.),  the  telescope's  primary  mirror  (PM)  would  likely 
be  an  ensemble  of  mirror  segments  deployed  as  concentric  rings  of  reflective 
mirror  segments.  Most  optical  design  forms  for  high-performance  telescopes 
employ  a  conic  PM  shape.  The  use  of  conic  segments  would  require  a  different 
mirror  shape  for  each  ring  of  mirrors,  and  thus  different  fabrication  and  testing 
facilities  for  the  mirrors  in  each  ring,  at  significant  cost  and  schedule.  The 
prospect  of  assembling  the  primary  mirror  from  small,  low-cost  identical  spherical 
mirror  segments  is  very  attractive. 

This  paper  discusses  a  concept  for  a  telescope  optical  design  form  that 
would  use  one  spherical  shape  for  all  of  the  PM  segments,  and  correct  the 
increased  aberration  with  a  segmented  corrector  downstream  at  a  pupil.  The  PM 
segments  would  be  individually  despaced  and  tilted  to  optimize  performance. 
Additionally,  the  aspheric  coefficients  on  the  segmented  corrector  surfaces  would 
be  different  for  each  radial  zone;  this  is  the  key  feature  of  the  approach,  and 
allows  a  better  correction  than  if  a  single  aspheric  shape  were  applied  to  the 
corrector  surface.  Preliminary  design  results  for  a  30  m  Gregorian  telescope 
show  sixth-wave  peak-to-valley  residual  optical  path  differences  (OPD's)  over  a 
0.6  arc-minute  full  field  of  view.  The  results  show  that  the  concept  works  well  for 
small  FOV  systems,  which  may  be  sufficient  given  constraints  on  downstream 
instrument  package  sizes  as  a  result  of  the  optical  invariant. 

This  high-risk,  potentially  high-payoff  concept  separates  one  very  difficult 
problem  into  two  (hopefully)  less  difficult  ones.  There  remains  the  issue  that,  if 
the  scheme  were  technically  possible,  the  decrease  in  PM  fabrication  &  testing 
cost  from  using  spherical  segments  would  have  to  more  than  offset  the  cost 
increase  from  the  pupil  corrector. 
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Liquid  Crystal  Devices  as  Holographic  Recording  Media 
for  Aberration  Compensation  in  Membrane-Mirror 
Based  Telescope  Systems 

Ty  Martinez  and  Mark  Gruneisen, 

United  States  Air  Force  Research  Laboratory 
Directed  Energy  Directorate,  Kirtland  AFB,  New  Mexico  87117 
phone:  (505)  846-9296,  fax:  (505)  853-6236,  email:  martinty@plk.af.mil 

Kenneth  MacDonald  and  Jeffery  Baker 
Boeing  North  American,  PO  Box  5670 
Albuquerque  NM  87185 

phone:  (505)  853-4362,  email:  kenneth.macdonald@plk.af.mil 

ABSTRACT 

Membrane  mirrors  are  a  lightweight  alternative  to  conventional  telescope  primary  mirrors 
large  aberrations  that  exceed  the  correction  capabilities  of  conventional  adaptive  optics 
recording  media  can  compensate  membrane  mirror  aberrations. 

INTRODUCTION 

Telescopes  are  integral  components  of  light  gathering  and  beam  projecting  optical  systems  with  diverse  military 
and  civilian  applications.  Orbiting  space-based  telescope  systems  offer  many  advantages  including  reduced  effects 
from  atmospheric  turbulence.  In  order  to  achieve  high-resolution  optical  performance  from  orbital  distances 
however,  the  telescope  apertures  must  be  scaled  to  large  diameters.  For  some  applications,  aperture  sizes  of  tens  of 
meters  may  be  required.  At  these  diameters,  the  size  and  weight  of  conventional  primary  mirrors  are  beyond  the 
capacity  of  existing  launch  vehicles. 

These  limitations  may  be  overcome  with  lightweight  compact  primary  mirror  architectures  such  as  deployable 
membrane  mirrors  currently  under  development  at  the  Air  Force  Research  Laboratory  (AFRL).  In  recent  years, 
there  have  been  significant  improvements  in  the  engineering  of  membrane  optics.  1  Nonetheless,  it  is  believed  that 
such  ultra  lightweight,  large  aperture  mirrors  will  have  significant  aberrations  compared  to  conventional  mirrors. 

The  large  aberrations  associated  with  the  ultra  lightweight  primary  mirrors  create  the  need  for  an  optical 
wavefront  compensation  technique  that  can  remove  100’s  of  microns  of  optical  path  difference  (OPD).  This 
exceeds  the  capabilities  of  conventional  deformable-mirror  adaptive  optical  systems,  such  as  those  developed  and 
demonstrated  by  the  AFRL  Starfire  Optical  Range.  2  These  deformable-mirror  devices  rely  on  actuators  to  deform  a 
mirror  surface  and  introduce  an  equal-but-opposite  aberration  to  the  aberrated  wavefront.  The  achievable  resolution 
and  magnitude  of  displacement  limit  correctable  OPD  to  only  a  few  microns  and  consequently  this  technology  is  not 
adequate  for  the  membrane  mirror  systems. 

In  recent  years,  compensation  of  severe  dynamic  aberrations  in  a  membrane-mirror  based  telescope  using  real¬ 
time  holographic  phase  subtraction  has  been  demonstrated.  3  This  technique  used  a  liquid  crystal  optically  addressed 
spatial  light  modulator  (OASLM)  to  compensates  for  OPD  errors  on  the  order  of  1 00  pm  and  was  applied  at  rates  up 
to  1  kHz,  which  is  more  than  adequate  to  compensate  time-dependent  aberrations  associated  with  the  dynamic 
motion  of  the  thin  membrane. 4  Such  a  capability  may  dramatically  reduce  the  primary  mirror  surface  tolerances 
making  feasible  the  implementation  of  ultra-lightweight  deployable  primary  mirrors. 

THEORETICAL  LIMITS  FOR  THIN  DIFFRACTION  MEDIA 

Historically,  the  principle  shortcoming  of  the  liquid  crystal  OASLM  has  been  limited  diffraction  efficiency.  As 
thin  diffraction  media,  the  maximum  theoretical  first-order  diffraction  efficiencies  are  6%  and  1 0%  for  analog  and 
binary  amplitude  modulation  and  34%  and  40%  for  analog  and  binary  phase  modulation  respectively. 
Commercially,  OASLM  devices  have  been  primarily  developed  as  amplitude  modulators  for  incoherent-to-coherent 
converters  and  image  projection  applications.  Consequently,  only  devices  with  diffraction  efficiencies  of  a  few 
percent  were  widely  available.  A  commercially  available  nematic-based  OASLM ’s  can  be  operated  as  analog  phase 
modulators  with  30%  diffraction  efficiency.  But  the  diffraction  occurs  for  only  one  polarization  and  with  relatively 
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slow  relaxation  time  compared  to  ferroelectric  based  devices.  5  More  recently,  computer-generated  blazed  phase 
holograms  have  been  demonstrated  with  90%  diffraction  efficiency.  6 

BLAZED  HOLOGRAPHIC  CORRECTION  OF  ABERRATIONS  IN  A  MEMBRANE-MIRROR  TELESCOPE 

A  schematic  of  the  compensated  membrane  mirror  telescope  experimental  setup  is  shown  in  Figure  1 .  An  object 
point  source  at  infinity  is  simulated  by  illuminating  a  20-  pm  pinhole  with  the  532  nm  output  of  a  frequency  doubled 
Nd:YAG  laser  and  collimating  the  transmitted  light  with  a  conventional  f/6  parabolic  mirror.  The  clear  aperture  of 
the  collimated  wavefront  is  restricted  to  50  mm  with  a  circular  aperture.  The  object  wavefront  is  then  propagated  to 
the  low-optical-quality  membrane  mirror  operating  as  a  50-mm  diameter  f/18  off-axis  parabola.  The  membrane 
mirror  is  fabricated  from  125-  pm  thick,  aluminum-coated,  upilex  plastic  mounted  to  a  vacuum  flange  and  held  under 
partial  vacuum. 

The  mirror  surface  is  probed  and  an  interferogram  of  the  aberration  is  recorded  on  a  video  camera  as  shown  in 
Fig.  1 .  The  interferogram  is  then  digitized  and  processed  and  the  resulting  blazed  hologram  used  for  the  correction 
is  applied  on  the  EASLM,  which  is  projected  on  the  OASLM.  The  diffracted  corrected  order  is  then  relayed  to  a 
camera  located  in  an  image  plane  for  the  object  pinhole. 


Figure  1.  Experimental  setup  for  demonstrating  compensated  membrane-mirror  telescope  with  real-time  blazed  hologram. 

CONCLUSIONS 

Real-time  holographic  phase  subtraction  is  demonstrated  to  be  capable  of  removing  significant  aberrations  from 
membrane-based  primary  mirrors.  Historically,  low  diffraction  efficiencies  of  real-time  hologram  recording  media 
limited  the  performance  of  such  compensated  systems.  Recent  developments  in  liquid-crystal  spatial  light 
modulators  have  increased  the  diffraction  efficiencies  to  approach  the  theoretical  limits  for  simple  thin  phase 
holograms  and  blazed  phase  holograms. 
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Lasers  and  Electro-Optical  Systems,  Boeing  Space  &  Communications  Group 
6633  Canoga  Ave.  MC  FA-04,  P.O.  Box  7922,  Canoga  Park,  CA  91309-7922 

1.  INTRODUCTION 

Large  deployable  space-based  optical  systems  will  likely  require  complex  structure  position  controls  in  conjunction  with  an 
adaptive  optic  to  maintain  optical  tolerances  necessary  for  near  diffraction-limited  performance.  A  real-time  holographic 
(RTH)  compensation  system  can  greatly  reduce  the  requirements  and  complexity  of  the  position  control  system  and  enable 
the  use  of  novel  or  imperfect  optical  components  for  large  mirror  surfaces.  Activities1'2  in  compensated  imaging  techniques 
have  focused  on  methods  such  as  holography  to  allow  imaging  with  low  quality  optics. 

The  holographic  method  for  correcting  aberrations  was  first  proposed  by  Gabor  in  19493.  In  the  60’s  Kogelnik  and 
Pennington4  and  Upatnieks  et  al.5  used  holographic  plates  (static  correction)  to  remove  random  aberrations  and  spherical 
aberration,  respectively,  for  telescope  systems.  More  recently,  Munch  et  al6'*  have  been  using  holographic  plates  to  correct 
aberrated  primaries  in  reflecting  and  refractive  telescopes  with  broadband  images6'7  and  most  recently  have  demonstrated  a 
diffraction  limited  system  with  a  severely  aberrated  0.5  m  primary8. 

The  spatial  light  modulator  has  allowed  the  realization  of  dynamic  correction  of  aberrated  primary  mirrors.  Recent  activity 
has  focused  on  using  liquid  crystal  optically-addressed  spatial-light-modulators  (LC  OASLMs)  as  the  holographic  medium  of 
a  dynamic  correction  element  in  holographic  correction  systems  Both  Vasil’ev  et  al?  and  Gruneisen  et  al?  have  used 
OASLMs  and  RTH  to  correct  for  primary  mirror  distrtions.  The  former  demonstrated  correction  of  a  poor  quality  segmented 
primary  mirror  while  the  latter  demonstrated  correction  of  a  membrane  primary  mirror. 

Our  group  has  been  pursuing  real-time  holographic  (RTH)  image  correction  since  1990  to  compensate  for  atmospheric 
turbulence9  as  well  as  turbulent  boundary  layer  optical  distortions10.  The  latter  experiments  concluded  with  successful 
dynamic  image  correction  for  field  tests  involving  horizontal  imaging  at  a  range  of  one  mile.  In  this  paper  we  report  on  a 
dynamic  correction  system  capable  of  5  kHz  operation  using  real-time  holography  to  correct  broadband  (532  ±  40  nm) 
illuminated  targets  collected  by  an  aberrated  0.75  m  primary  mirror.  Scaling  this  technology  to  larger  primary  mirror 
diameters,  higher  optical  efficiencies,  and  increased  power  handling  capacity  requires  system  level  trades  of  temporal, 
spectral  and  spatial  bandwidths,  field  of  view,  magnification  and  LC  OASLM  device  development. 

2.  EXPERIMENTAL 

A  schematic  diagram  of  the  large  optics  compensator  testbed  is  shown  in  Figure  1.  The  setup  consists  of  a  scene  generator,  a 
telescope  receiver  primary  mirror,  the  telescope  secondary  with  imaging  optics,  and  a  holographic  compensator  beacon  leg. 
Both  the  scene  generator  and  telescope  receiver  use  identical  0.75  m  f/4  parabolic  aluminum  surface  Pyrex  mirrors  with 
measured  figures  around  0.5  X  peak  to  valley  (P-V). 


3.  TEST  AND  ANALYSIS 

A  series  of  tests  was  carried  out  to  demonstrate  broad  spectral  band  compensation  and  explore  the  correctable  phase/spectral 
bandwidth  trade  space.  Initial  setup  and  alignment  tests  used  the  532  nm  laser  source  for  target  illumination.  These  tests 
were  followed  by  a  complete  series  of  monochromatic  and  polychromatic  compensation  tests  with  fixed  mirror  warps  up  to 
22  X.  Figure  2  shows  image  correction  of  the  Air  Force  Test  Pattern  for  a  22X  misfigured  primary.  Dramatic  correction  is 
seen  for  the  severely  distorted  primary.  The  expected  degradation  of  image  compensation  with  increasing  spectral  bandwidth 
is  observed.  Similar  qualitative  tests  were  carried  out  at  mirror  warps  of  3,  7.5,  and  12  X. 


Spectral  Bandwidth  (FWHM,  nm) 

35  50  80  200 


Distorted  Images 


RTH  Compensated  Images 

Figure  2.  Distorted  and  corrected  broad  spectral  band  images  with  22  A  of  mirror  warp. 

4.  CONCLUSION 

We  have  demonstrated  compensation  of  a  severely  warped  meter-class  telescope  and  explored  the  performance  trade  space. 
RTH  compensation  can  permit  the  use  of  large  imperfect  mirrors  of  misphased  mirror  segments  in  aggressively  lighweighted 
deployable  optics.  The  technique  can  also  compensate  for  dynamic  aberrations  introduced  by  vibrations  and  thermal  loading. 
This  can  greatly  reduce  the  weight  and  complexity  of  isolation  and  positioning  systems  for  large  deployable  telescope 
systems.  The  optical  efficiency  of  an  OASLM-based  holographic  compensation  system  can  approach  40%  while  providing 
the  equivalent  spatial  resolution  of  deformable  mirrors  containing  more  than  10,000  actuators.  We  have  also  demonstrated 
increased  optical  efficiency  by  the  generation  of  blazed  gratings.  Increased  power  handling  and  compensation  of  larger  area 
primary  mirrors  requires  development  of  larger  area  OASLMs. 
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Noise  and  Distortion  Characteristics  of  Cryogenic 
Vertical  Cavity  Lasers 
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Department  of  Electrical  and  Computer  Engineering 
University  of  California,  Santa  Barbara,  CA  93106 


High  speed  free-space  optical  interconnections,  from  focal  plane  arrays  in  a  cryogenic  environment  to 
room  temperature  electronics,  require  the  optical  transmitter  with  favorable  analog  modulation  properties 
over  wide  temperature  range.  Due  to  the  small  size,  low-power  consumption,  and  the  possibility  of 
integration,  vertical  cavity  laser  (VCL)  is  one  of  the  best  candidates  for  such  applications  [1,2].  In  previous 
work,  it  has  been  demonstrated  that  the  low-temperature  optimized  VCLs  had  higher  output  powers,  higher 
quantum  efficiencies,  and  larger  modulation  bandwidths  at  77K  [3].  In  this  paper,  both  the  noise  and 
distortion  characteristics  of  the  said  VCLs  are  further  investigated  under  both  room  temperature  and 
cryogenic  environment. 

The  measured  lasers  were  the  double  intracavity,  oxide-aperture,  5.4pm  diameter  VCLs  with  an 
operating  wavelength  of  940nm.  The  structure  and  the  processing  of  the  low-temperature  optimized  devices 
have  been  described  previously  in  [2,3],  The  1x6  laser  array  collimated  by  a  microlens  was  assembled  into 
a  cryostat  to  realize  the  cryogenic  operation.  The  optical  beams  transmitted  out  from  the  cryostat  were 
further  converged  by  another  lens  and  received  by  a  silicon  avalanche  photodetector  (APD).  The 
measurement  equipment  was  the  HP  8566B  electrical  spectrum  analyzer  with  10Hz  resolution. 

Figure  1  shows  the  L-I-V  curves  of  the  VCL,  which  were  measured  outside  the  cryostat  at  120  and 
297K.  The  threshold  of  0.15mA  and  differential  efficiency  of  0.5mW/mA  were  measured  at  120K,  while 
the  values  at  297K  were  respectively  0.4mA  and  0.37mW/mA.  The  differential  efficiency  is  much  higher  at 
lower  temperature  is  because  that,  due  to  joule  heating,  the  gain  peak  and  the  resonance  mode  are 
improving  their  mutual  alignment  as  the  device  is  biased. 

Relative  noise  intensity  (RIN)  is  the  most  important  parameter  to  embody  the  noise  properties  of 
lasers.  In  order  to  measure  it,  a  40dB  electrical  amplifier  was  placed  behind  the  APD  to  elevate  the  source 
noise  above  the  noise  floor  of  the  whole  link.  Figure  2  shows  the  relative  noise  measured  at  297K  against 
various  bias  levels.  At  bias  level  of  1 .5mA  corresponding  to  the  0.5mW  output  power,  the  relative  noise 
level  is  lower  than  -130dB/Hz,  which  is  much  lower  than  the  practical  requirement  of  the  high-speed  data 
communications. 

Even  though,  it’s  found  that  the  noise  still  dominatingly  attributes  to  the  shot  noise  of  the  APD,  rather 
than  the  source  RIN.  To  verify  this,  an  optical  attenuator  was  introduced  to  change  the  optical  power 
detected  by  APD.  The  photodiode  shot  noise  is  directly  proportional  to  the  received  optical  power,  while 
the  source  RIN  is  proportional  to  the  square  of  the  optical  power.  The  relationship  between  noise  level  and 
the  attenuation,  which  is  shown  as  figure  3,  confirms  that  the  measurement  link  is  limited  by  the  shot  noise 
at  high  bias  current.  Only  under  the  bias  level  just  above  the  threshold  does  the  logarithmic  slope  approach 
two,  which  is  because  that  the  RIN  of  the  laser  is  inversely  proportional  to  the  cubic  of  the  relative  bias 
level. 
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Current  (mA) 

Fig.l  L-I-V  characteristics  for  940nm  VCL  under 
both  room  and  cryogenic  temperatures. 


Fig-2  Relative  noise  of  the  measuring  link  Inset  is 
the  relative  noise  versus  bias  current  at  500MHz. 
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Optical  Attenuation  (dB)  Frequency  (GHz) 

Fig-3  Detected  relative  noise  values  while  adjusting  Fig.4  SFDR  value  of  VCL  versus  modulation 
the  optical  attenuator  between  VCL  and  APD.  frequency  with  different  bias  currents. 

Further  comparing  the  relative  noise  levels  at  different  biases  shown  as  the  inset  of  the  fig.2,  there  is  an 
evident  rise  around  3mA  bias.  The  reason  is  that  the  linear  polarization  mode  below  2mA  won’t  be  stable 
any  more  at  higher  bias  current.  The  random  mode  hopping  produces  the  additional  polarization  mode 
noise  on  the  whole  transmission  link  even  without  any  polarization-sensitive  component. 

The  spur  free  dynamic  range  (SFDR)  of  the  VCL  was  measured  with  the  conventional  two-tone 
modulation  [4].  The  frequency  spacing  between  the  two  equal-magnitude  modulation  signals  was  1MHz. 
The  SFDR  value  is  defined  as  the  input  signal  power  ratio  of  the  point  where  the  fundamental  crosses  the 
noise  floor  to  the  point  where  the  third-order  distortion  signal  crosses  the  noise  floor.  Due  to  the  above- 
mentioned  limitation  of  the  shot  noise,  the  upper  limit  of  the  noise  floor  is  assumed  to  be  -150dBm/Hz 
below.  Therefore,  the  practical  dynamic  range  will  be  even  higher  than  the  calculated  SFDR  values  here. 

Figure  4  shows  the  dynamic  range  of  the  VCL  against  frequency  and  bias  current  under  both  room  and 
cryogenic  temperatures.  Comparing  the  curves  under  different  bias  levels,  it’s  evident  that  the  optimal  bias 
state  corresponding  to  the  highest  dynamic  range  is  around  1 .5mA.  The  highest  SFDR  value,  96dB  Hz273,  is 
much  higher  than  the  requirement  for  the  personal  communication  systems  which  is  in  the  range  of  72- 
83dB  Hz273.  Furthermore,  the  dynamic  range  at  120K  is  about  2dB  higher  than  the  case  under  the  room 
temperature.  It  means  that  the  low-temperature  optimized  VCL  does  have  the  better  linearity  under  the 
cryogenic  environment. 

In  conclusion,  both  the  noise  and  distortion  properties  of  the  low  power  VCL  have  been  studies  under 
both  room  and  cryogenic  temperatures.  The  noise  of  the  measurement  link  with  high  bias  level  is 
dominated  by  the  photodiode  shot  noise.  The  upper  limit  on  the  RIN  of  the  VCL  with  0.5mW  output  power 
is  -130dB/Hz.  At  high  bias  levels,  the  polarization  mode  hopping  will  induce  additional  noise,  even 
without  any  strongly  polarization-selective  components  in  the  link.  The  SFDR  level  of  90-100dB  Hz273  is 
measured.  The  best  linearity  is  produced  under  1.5mA  bias  current.  Especially  the  higher  dynamic  range 
can  be  realized  under  lower  temperature,  which  authenticates  the  validity  of  the  low-temperature  optimized 
VCLs. 
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Detector  Array  Considerations  for  Laser  Remote  Sensing 
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Department  of  Physics 
University  of  South  Florida 
Tampa,  FL  33620 


We  have  developed  a  1.5  micron  multi-detector  heterodyne  Doppler  lidar  type  system  in 
the  laboratory  to  study  the  enhancement  in  expected  S/N  through  use  of  a  multi-element 
and  balanced  heterodyne  detector  array.  Preliminary  experiments  have  been  conducted 
and  indicate  that  the  coherent  summation  of  two  separate  balanced  heterodyne  detector 
signals  seems  to  provide  enhanced  detection  of  the  Doppler  shifted  signal.  Initial  studies 
regarding  the  expected  Doppler  shift,  geometrical  vibration  effects,  and  other  system 
noise  sources  are  being  studied.  We  are  in  the  process  of  comparing  these  new  results 
with  those  obtained  previously  by  us  at  1  micron  using  an  atmospheric  Heterodyne  multi¬ 
detector  Lidar  system. 


Our  1.5  micron  Doppler  lidar  (laboratory)  system  is  as  follows:  A  30  mW  1 .5  micron, 
single-frequency  diode  laser  is  used  as  the  main  excitation  source.  The  beam  is  split  into 
a  L.O.  channel  and  a  main  projected  beam  followed  by  a  27MHz  shifted  Acousto-Optic 
frequency  shifter.  The  main  portion  is  sent  to  a  rotating  target,  and  the  reflected 
backscatter  collected  by  2  spatially  separated  (V  groove  blocks)  fiber-optic  ,  single 
frequency,  polarization  preserving  fibers.  At  present,  the  two  collection  fibers  are 
separated  by  100  microns;  as  such,  this  is  the  effective  separation  of  the  two  coherent 
detectors.  The  two  collected  signals  are  branched  into  a  I  &  Q  channels  for  balanced 
heterodyne  detection.  The  fours  channels  are  detected  and  the  electrical  signals  analyzed 
by  spectrum  analyzers,  4  channel  oscilloscopes,  and  computer  interfaced. 


We  are  currently  studying  the  change  in  the  Doppler  center  frequency,  spread  of  the 
Doppler  shifted  signal,  influence  on  the  geometrical  and  velocity  aspects  of  the  target, 
•and  linewidth  stability  of  the  diode  laser. 
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Developments  in  Remote  Sensing  Instrument  Optical  Designs 

Michael  P.  Chrisp 

Ball  Aerospace  and  Technologies  Corporation, 

1600  Commerce  St.,  Boulder  CO  80301 

1.  Introduction 

Remote  sensing  from  space  utilizing  imaging  spectrometry  is  placing  greater  demands  on  the 
design  of  these  instruments.  The  requirements  for  quality  and  purity  of  the  spectral  data 
increases  with  each  generation  of  imaging  spectrometers  and  drives  the  development  of 
improved  designs.  These  designs  provide  control  of  the  spectral  stray  light,  the  non-linearity 
of  the  spectral  dispersion,  the  spectrometer  size,  and  spectral  transmittance. 

This  presentation  covers  the  application  of  imaging  spectrometer  designs  based  on  convex 
gratings,  and  designs  utilizing  apochromatic  lens  systems.  Convex  grating  designs  are 
compact  with  good  spectral  stray  light  control.  The  apochromatic  lens  designs  can  provide 
good  performance  over  large  wavelength  ranges. 

2.  Imaging  Spectrometers  utilizing  Convex  Grating  Designs 

Imaging  spectrometers  based  on  the  Offner  type  convex  grating  design  provide  extremely 
good  spectral  and  spatial  resolution.  They  consist  of  two  concave  mirrors  and  a  convex 
grating  arranged  in  a  similar  arrangement  to  the  Offner  reflective  relay.  The  power  of  these 
optical  elements  is  balanced  to  ensure  a  flat  field.  At  reasonable  f-numbers,  spherical  surfaces 
can  be  used,  for  lower  f-number  designs  aspheric  surfaces  help  control  the  higher  order 
aberrations.  These  spectrometer  designs  have  very  low  spectral  slit  curvature  and  low  spatial 
distortion  over  the  spectral  range.  The  grating  provides  good  linearity  of  the  spectral 
dispersion  across  the  focal  plane.  Because  these  designs  are  all-reflective  system,  they  are  less 
sensitive  to  the  thermal  and  radiative  environments  compared  to  refractive  designs. 

Convex  grating  spectrometers  have 
extremely  low  stray  light  for  a  single 
stage  spectrometer  system.  There  are  9.5  cm 

only  three  reflecting  optical  elements 
that  can  scatter,  and  a  high  quality 
holographic  grating  in  the 
spectrometer  provides  good  spectral 
stray  light  control.  The  principal 
difficulty  with  these  designs  is  the  low 
diffraction  efficiency  of  holographic 
gratings  (which  have  sinusoidal 
profiles).  For  greater  diffraction 
efficiency,  electron  beam  written  gratings 
are  necessary,  and  these  are  currently 
produced  by  the  Jet  Propulsion  Laboratory. 


Figure  1  Convex  Grating  Pushbroom  Imaging 
Spectrometer  (M.  Chrisp  Patent  No.  5,880,834) 
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Shown  in  figure  1  is  an  imaging  spectrometer  based  on  a  convex  grating  design.  This  design 
has  a  three  mirror  anastigmatic  telescope  followed  by  a  convex  grating  imaging  spectrometer. 
The  exit  pupil  position  from  the  telescope,  although  not  telecentric,  is  a  good  match  for  the 
convex  grating  spectrometer.  This  pushbroom  imaging  spectrometer  is  designed  for  space 
remote  sensing  and  has  good  spatial  and  spectral  performance. 

3.  Imaging  Spectrometers  utilizing  Apochromatic  Lens  Systems 

Imaging  spectrometers  with  apochromatic  camera  and  collimator  lenses  provide  good  spectral 
and  spatial  performance  over  extended  wavelength  ranges.  They  are  useful  for  the  visible  and 
infrared  regions,  but  are  not  applicable  to  the  ultraviolet  region.  The  multiple  lens  surfaces 
create  higher  spectral  stray  light  than  reflective  spectrometer  systems,  primarily  due  to  the 
multiple  ghost  images. 

Shown  in  Figure  2  is  a  design  of  a  prism  spectrometer  covering  a  large  wavelength  range  over 
the  visible  and  near  infrared.  The  apochromatic  lenses  enable  this  wavelength  range  to  remain 
in  focus  over  the  detector  array.  The  prism  has  the  advantage,  over  a  grating  system,  of  high 
transmittance  over  this  large  wavelength 
range. 


Prism  designs,  like  this,  are  limited  in 
their  spectral  resolution,  and  also  the 
non-linearity  of  their  spectral  dispersion. 

It  is  possible  to  use  multiple  prisms  to 
linearize  the  spectral  dispersion,  but  this 
decreases  the  spectral  resolution  further. 

However,  prism  systems  avoid  the  Figure  2  Imaging  Spectrometer  utilizing 

grating  systems’  problem  of  order  overlap  for  Apochromatic  Lenses 

multiple-octave  wavelength  ranges. 

For  higher  spectral  dispersion,  grating  systems  with  apochromatic  lenses  provide  good 
spectral  performance.  They  can  use  readily  available  high  diffraction  efficiency  plane  gratings 
(unlike  the  convex  grating  designs);  however,  correcting  the  spectral  line  curvature  in  these 
designs  is  very  difficult. 

4.  Conclusion 

The  choice  of  an  imaging  spectrometer  design  for  a  given  application  depends  upon  the 
dominant  requirements  for  spectral  retrieval  algorithms.  These  algorithms  set  the  tolerances 
on  the  linearity  of  the  spectral  dispersion,  and  the  signal-to-noise  ratios  required.  They  also 
determine  if  the  detector  array  can  be  resampled  for  spectral  correction.  The  latest  convex 
grating  spectrometer  designs  provide  good  spectral  performance  in  a  compact  instrument,  and 
the  latest  apochromatic  lens  spectrometer  designs  provide  high  transmittance  over  large 
wavelength  ranges. 
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Design  and  Operation  of  the  Hopkins  Ultraviolet  Telescope 

By 

Samuel  T.  Durrance 

Florida  Space  Institute,  University  of  Central  Florida 

The  Hopkins  Ultraviolet  Telescope  (HUT)  was  flown  aboard  the  Space  Shuttle 
Columbia  for  9  days  in  December  1990  and  the  Space  Shuttle  Endeavour  for  17  days  in 
March  1995.  Specrtophotometric  observations  of  numerous  astronomical  sources  were 
made  throughout  the  far  ultraviolet  (912-1850  A)  at  a  resolution  of  -3  A,  and  for  some 
sources,  in  the  extreme  ultraviolet  (415  -  912  A)  at  a  resolution  of  ~1.5  A.  In  this  paper 
we  describe  the  HUT  instrument  and  its  performance  in  orbit. 

The  optical  system  consists  of  a  90  cm  diameter  / /2  primary  mirror  and  a  prime- 
focus,  Rowland  circle  spectrograph.  These  elements  are  held  in  position  by  a  metering 
structure  of  low-expansion  Invar.  The  entire  assembly  is  housed  within  an  aluminum 
environmental  control  canister,  which  protects  against  contamination  on  the  ground  and 
provides  a  thermally-controlled,  optically-shielded  environment  during  observations.  The 
parabolic  primary  mirror  is  fabricated  of  Zerodur  and  figured  to  provide  1  image 
quality  at  the  focal  plane.  It  was  coated  with  iridium  for  Astro- 1,  which  provides  -20% 
reflectivity  and  is  stable  in  the  low-Earth  orbital  environment.  It  was  coated  with  SiC  for 
Astro-2,  which  provides  -  40%  reflectivity  but  may  degrade  in  the  low-Earth  orbital 
environment.  Three  motorized  mechanisms  in  the  Invar  mirror-mounting  structure 
provide  the  capability  to  focus  and  align  the  instrument  during  the  mission. 

The  prime  focus  spectrograph  is  housed  in  an  evacuated  stainless  steel  enclosure, 
whose  vacuum  is  maintained  at  -10'6  torr  by  redundant  Vac-Ion  pumps.  The 
spectrograph  contains  a  20  cm  diameter / /2  spherical  concave  grating,  holographically 
generated  (600  lines  mm'1)  and  overcoated  with  osmium.  The  depth  of  the  sinusoidal 
groove  profile  is  chosen  to  provide  optimal  first-order  efficiency  at  1000  A.  Light  passing 
through  the  spectrograph  entrance  aperture  is  diffracted  and  focused  onto  a  photon¬ 
counting  Csl  coated  microchannel-plate  intensifier  at  a  dispersion  of  41  A  mm"1.  The 
intensifier  employs  a  chevron  pair  microchannel  plates  (MCPs)  with  10  m  di  a  meter 
pores  and  25  mm  active  area,  operated  at  a  gain  of  2  x  106.  Charge  pulses  from  the  MCPs 
impinge  on  a  P20  phosphor,  whose  visible  light  output  is  coupled  via  fiber  optics  to  a 
one-dimensional  1024  element  self-scanned  photodiode  array  (Reticon)  clocked  at  1  scan 
ms'1.  A  typical  charge  pulse  covers  -  8  photodiodes  (200  m)  at  threshold.  Each  pulse  is 
centroided  by  an  onboard  computer,  yielding  a  pixel  size  of  12.5  m  ( half  the  diode 
width  or  0.51  A  in  first  order),  and  is  accumulated  in  a  2048  element  spectral  array.  The 
detector  resolution  is  -1  A,  but  the  aberrations  of  the / /2  optical  system  limit  the 
effective  resolution  for  a  point  source  to  about  3  A  in  first  order.  The  plate  scale  at  the 
detector  is  2.8  A'1,  so  when  the  pointing  is  stable  (±1  )  the  resolution  is  not 
significantly  degraded  by  image  motion. 

An  eight-position  rotating  mechanism  at  the  spectrograph  entrance  provides  a 
choice  among  six  aperture  sizes  and/or  filters  for  observations,  as  well  as  two  positions 
for  ground  operations  (a  vacuum  seal  position  and  a  large  aperture  for  calibration).  Point 
source  observations  are  generally  made  through  one  of  two  circular  apertures  of  1 8  and 
30  diameters.  Extended  source  observations  are  generally  made  through  one  of  two 
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long-slit  apertures  with  dimensions  of  9.4  x  1 16  and  17  x  1 16  ,  providing 
resolution  of  3.3  and  6  A,  respectively. 

The  apertures  were  etched  into  a  mirrored  surface  that  reflects  visible  light  from 
the  surrounding  star  field  through  a  re-imaging  lens  system  and  filter  wheel  onto  the  focal 
plane  of  a  silicon  intensified  target  (SIT)  vidicon  camera.  The  camera  views  a  9  x  12 
field  surrounding  the  spectrograph  aperture.  Digitized  video  images  are  analyzed  by  the 
onboard  processor  and  compared  with  pre-defined  target  and  guide  star  positions.  Final 
target  identification  is  made  onboard  by  an  astronaut,  who  completes  the  acquisition  by 
adjusting  the  Instrument  Pointing  System.  Adjustments  to  the  camera  gain  and  integration 
time,  as  well  as  selection  of  an  appropriate  neutral  density  filter  wheel  position,  permit 
acquisition  of  targets  with  visual  magnitudes  ranging  from  -4  to  +17. 

HUT  incorporates  two  custom-built  microprocessors.  The  spectrometer  processor 
(SP)  handles  the  high-rate  data  from  the  Reticon,  applying  commandable  pulsewidth  and 
amplitude  limits  to  the  digitized  Reticon  output  for  noise  rejection,  calculating  centroids, 
and  creating  telemetry  messages  in  several  possible  formats.  A  high-time-resolution 
mode  provides  a  time-tagged  list  of  each  individual  photon  event,  with  an  accuracy  of  ±1 
ms.  This  mode  can  be  employed  for  sources  whose  total  count  rates  do  not  exceed  ~  600 
counts  s'1.  For  higher  rate  sources,  histogram  mode  is  employed,  in  which  a  cumulative 
2048  element  histogram  is  output  to  the  telemetry  every  2  s.  This  cumulative  histogram  is 
also  downlinked  every  60  seconds  in  high-time-resolution  mode.  Thus  when  data 
dropouts  occur,  only  the  photon  arrival  time  information  is  lost,  and  the  spectral  data  are 
preserved.  The  dedicated  experiment  processor  (DEP)  performs  all  the  functions  of 
command  and  control  of  the  HUT  instrument,  and  also  digitizes,  stores  and  analyzes  the 
video  images  from  the  target  acquisition  camera. 

The  HUT  detector  is  limited  by  phosphor  persistence,  pulse  size  and  scan  rate  to 
total  count  rates  5  000  counts  s'1 .  To  observe  targets  with  higher  count  rates  the  HUT 
telescope  aperture  may  be  reduced  from  the  nominal  full  aperture  size  of  5120  cm2.  A 
50%  reduction  is  accomplished  by  closing  one  of  two  semi-circular  shutter  doors.  Further 
reduction  to  1%  (50  cm2)  and  0.02%  (1  cm2)  of  full  aperture  can  be  made  by  closing  both 
shutter  doors  and  opening  either  of  two  small  apertures  provided  for  this  purpose. 

The  stability  of  the  instrument  throughput  was  monitored  for  Astro-2  by  making 
multiple  observations  of  the  DA  white  dwarf  GD  394  (at  64,  92,  138,  211,  and  341  hours 
MET),  in  addition  to  the  three  observations  of  HZ  43.  The  sensitivity  was  found  to 
decline  gradually  over  the  course  of  the  mission  in  a  wavelength-dependent  fashion.  The 
total  decline  was  relatively  small  at  long  wavelengths  (5%  at  1800  A),  and  rose  steadily 
towards  shorter  wavelengths  (27%  at  912  A).  The  rate  of  decline  was  greatest  early  in  the 
mission,  with  roughly  half  of  the  total  decline  occurring  prior  to  100  hours  MET,  and 
75%  of  the  total  decline  occurring  by  138  hours  MET. 

Neither  the  wavelength  dependence  nor  the  time  dependence  of  this  degradation 
matches  that  of  the  detector  exposure:  the  total  flux  per  unit  area  on  the  detector  has  a 
broad  maximum  over  1 100-1350  A  and  drops  rapidly  towards  either  end  of  the  detector, 
and  it  increases  with  time  during  the  mission.  Both,  however,  are  consistent  with  possible 
degradation  of  the  primary  mirror.  Two  possible  mechanisms  for  this  are  condensation  of 
contaminants  onto  the  mirror,  and  oxidation  of  the  SiC  coating  by  energetic  atomic 
oxygen.  One  of  the  witness  mirrors  removed  from  the  telescope  shortly  after  the  flight 
shows  similar  degradation,  and  is  being  investigated  further. 
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ThD4.3  (Invited) 
2:30pm-3:00pm 


The  Goddard  Space  Flight  Center  Laser  and  Electro-Optics  Branch 

By 

William  S.  Heaps 


The  Laser  and  Electro-Optics  Branch  at  Goddard  Space  flight  Center  was  established  about  three 
years  ago  to  provide  a  focused  center  of  engineering  support  and  technology  development  in 
these  disciplines  with  an  emphasis  on  development  of  spaced  based  instruments.  The  Branch  has 
approximately  15  engineers  and  technicians  with  backgrounds  in  physics,  optics,  and  electrical 
engineering.  Members  of  the  Branch  are  currently  supporting  a  number  of  space  based  lidar 
efforts  as  well  as  several  technology  efforts  aimed  at  enabling  future  missions. 

The  largest  effort  within  the  Branch  is  support  of  the  Ice,  Cloud,  and  land  Elevation  Satellite 
(ICESAT)  carrying  the  Geoscience  Laser  Altimeter  System  (GLAS)  instrument.  The 
ICESAT/GLAS  primary  science  objectives  are: 

To  determine  the  mass  balance  of  the  polar  ice  sheets  and  their 
contributions  to  global  sea  level  change 
To  obtain  essential  data  for  prediction  of  future  changes  in  ice 
volume  and  sea-level. 

The  secondary  science  objectives  are: 

To  measure  cloud  heights  and  the  vertical  structure  of  clouds 

and  aerosols  in  the  atmosphere 

To  map  the  topography  of  land  surfaces 

To  measure  roughness,  reflectivity,  vegetation  heights, 

snow-cover,  and  sea-ice  surface  characteristics. 

Our  efforts  have  concentrated  on  the  GLAS  receiver  component  development,  the  Laser 
Reference  Sensor  for  the  Stellar  Reference  System,  the  GLAS  fiber  optics  subsystems,  and  the 
prelaunch  calibration  facilities.  We  will  report  on  our  efforts  in  the  development  of  the  space 
qualified  interference  filter  [Allan],  etalon  filter,  photon  counting  detectors,  etalon/laser  tracking 
system,  and  instrument  fiber  optics,  as  well  as  specification  and  selection  of  the  star  tracker  and 
development  of  the  calibration  test  bed. 

We  are  also  engaged  in  development  work  on  lidar  sounders  for  chemical  species.  We  are 
developing  new  lidar  technology  to  enable  a  new  class  of  miniature  lidar  instruments  that  are 
compatible  with  small  Discovery-class  orbiters  now  in  the  NASA  planetary  program.  The 
purpose  of  the  lidar  is  to  continuously  profile  the  water  vapor  and  dust  in  the  Mars  atmosphere 
from  orbit  in  order  to  quantify  its  dynamics,  their  relationship  in  the  diurnal  cycles,  and  to  infer 
water  vapor  exchange  with  the  Mars  surface.  To  remotely  measure  the  water-vapor  height 
profiles,  we  will  use  the  differential  absorption  lidar  (DIAL)  technique.  We  are  also  developing  a 
laser  sensor  for  measuring  the  total  column  content  of  C02  in  the  atmosphere  of  the  earth.  C02  is 
the  principal  greenhouse  gas  and  has  increased  by  roughly  80  ppm  in  the  last  century  and  a  half. 

We  will  report  our  efforts  in  the  development  of  the  laser  transmitter  and  photon  counting 
detector  components  for  a  Mars  Orbiting  DIAL  system  and  for  the  C02  sounder. 
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The  Branch  is  also  involved  in  a  number  of  smaller  technological  efforts  aimed  at  furthering  our 
capability  to  make  laser  based  measurements  from  space.  One  of  these  is  an  effort  in  FM 
spectroscopy  for  remote  sensing  of  chemical  constituents.  The  standard  approach  for  constituent 
measurement  has  been  Differential  Absorption  Lidar.  Here  two  wavelengths  of  light  are  used  to 
probe  on  and  off  the  absorption  line  of  the  species  of  interest.  The  concentration  of  the  species  of 
interest  is  determined  from  the  small  difference  in  the  returns  at  the  two  wavelengths.  Remote 
FM  spectroscopy  promises  to  increase  the  sensitivity  by  2  or  3  orders  of  magnitude.  A  laser  is 
modulated  and  the  resulting  sidebands  used  to  probe  the  absorption  line.  Heterodyne  techniques 
can  then  be  used  to  sensitively  detect  the  return.  Our  group  is  conducting  experiments  on  using 
this  technique  in  a  remote  sensing,  range  resolved  mode  to  get  constituent  profiles  vs.  distance 
from  airplane  and  space  based  platforms. 

Another  small  effort  is  underway  to  develop  a  device  called  a  recirculating  optical  frequency 
shifter.  This  project  was  conceived  as  a  way  to  calibrate,  on  the  ground,  a  wind-speed-measuring 
Doppler  lidar  instrument  intended  for  use  from  orbit.  The  Doppler  shift  in  the  return  light  due  to 
orbital  velocity  is  at  least  an  order  of  magnitude  greater  than  what  can  be  achieved  by  a  single 
pass  through  an  acousto-optic  frequency  shifter.  In  this  experiment,  a  light  pulse  is  circulated 
several  times  through  a  fiber-optic  loop  that  contains  an  acousto-optic  frequency  shifter  and 
optical  amplifier.  A  large  frequency  shift  is  built  up  before  the  pulse  is  switched  out  of  the  loop. 

Finally  a  number  of  other  still  smaller  efforts  are  underway  to  improve  the  efficiency  of 
transmission  through  a  Fabry-Perot  interferometer  and  to  reduce  the  effects  of  signal  induced 
noise  in  photomultiplier  tubes.  These  will  be  discussed  if  progress  and  time  permits. 


References: 

“Narrow  Pass-Band  Optical  filters  for  Space-Borne  Remote  Sensing  Applications”.  G.  R.  Allan, 
M.  A.  Krainak  and  M.  A.  Stephen.  Paper  RWC9.  OS  A  Topical  Meeting  on  the  Remote  Sensing 
of  the  Atmosphere.  June  1999. 
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ThD5.1 

3:30pm-4:00pm 

Nonlinear  Optical  Properties  and  Morphologies  of  VOPc  Thin  Film 

Prepared  on  Polyimide  Film 

H.Nakano*,  J.  Ito*,  H.  Furuhashi*,  A.  Maeda*,  T.  Yoshikawa*,  Y.  Uchida*, 
K.Kojima*,  A.  Ohashi*,  °S.  Ochiai*  and  T.  Mizutani** 

*Dept.  of  Electrical  Eng.,  Aichi  Inst.  Tech.,  Toyota  470-0392,  JAPAN 
**Dept.  of  Electrical  Eng.,  Nagoya  Univ.,  Nagoya  464-8603,  JAPAN 
Tel:  +81-565-48-8121,  Fax:  +81-565-48-0010,  e-mail:  ochiai@ee.aitech.ac.jp 
Corresponding  Author:  Shizuyasu  Ochiai 

If  the  third  harmonic  phase  matching  condition  of  a  nonlinear  optical  material  is  satisfied,  the 
efficiency  of  third  harmonic  wave  is  improved,  and  the  TH  strength  increases  in  proportion  to 
the  square  of  the  film  thickness.  Therefore,  the  phase  matching  condition  was  examined 
from  the  relation  of  the  TH  strength  vs.  the  film  thickness.  Symbols  used  are  Ts:  substrate 
temperature,  t:  evaporating  time  and  d:  film  thickness.  Figure  1  shows  the  UV/VIS  spectra 
of  samples  1,  2  and  3.  They  show  a  shoulder  around  680nm,  an  absorption  peak  at  740nm 
and  a  shoulder  around  820nm  in  the  Q  band  region.  It  suggests  that  these  films  have  Phase  I 
and  II.  In  UV/VIS  spectra  of  samples  1  and  3,  the  absorbances  at  680nm  and  740nm 
increase  with  the  film  thickness.  It  suggests  that  sample  3  is  dominantly  formed  with  the 
Phase  I.  Figure  2  shows  the  incidental  angle  dependence  of  TH  strength  for  samples  1,  2 
and  3  irradiated  with  P  polarized  laser  beam.  The  TH  strength  saturates  with  the  film 
thickness.  It  suggests  that  the  TH  strength  strongly  depends  upon  the  morphology  of  VOPc 
thin  film  prepared  on  polyimide 
(PI)  film.  Figure  3  shows  the  UV/VIS 
spectra  of  samples  4,  5  and  6.  Three 
samples  have  an  absorption  peak  at 
840nm  in  the  Q  band  region.  It 
suggests  the  transition  from 
Phase  I  to  Phase  III  in  samples  4, 

5  and  6.  Figure  4  shows  the 
incident  angle  dependence  of  TH 
strength  for  samples  4,  5  and  6 
irradiated  with  P  polarized  laser  beam. 

The  TH  strength  increases  with 
the  film  thickness. 

It  means  that  the  packing  density 
of  the  Phase  III  is  higher  than 
that  of  Phase  I.  Figure  5 
shows  the  TH  strength  vs.  film 
thickness  characteristics.  Although 
a  VOPc  thin  film  dominated  with 
Phase  III  satisfies  a  Phase  matching 
condition,  a  VOPc  thin  film  of  Phase  I 
does  not. 


Figurel  UV/VIS  spectra  of  samples  1,  2  and  3. 
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Angle  of  incidence [deg. ] 


Figure  2  Incidental  angle  dependence  of  Figure. 3  UV/VIS  spectra  of  samples  4,  5 

TH  strength  for  Samples  1,  2  and  3  irradiated  and  6. 
with  P  polarized  laser  beam. 


Figure  4  Incidental  angle  dependence  Figure  5  TH  intensity  vs.  film  thickness, 

of  TH  strength  for  Samples  4,  5  and  6 
irradiated  with  P  polarized  laser  beam. 
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WC1.1  (Invited) 

1:30pm-2:00pm 

Economic  and  Architectural  Benefits  of  Hierarchical  Backbone  Networks 

Jane  M.  Simmons 

Corvis  Corporation,  7015  Albert  Einstein  Drive,  Columbia,  MD  21046-9400 

jsimmons@corvis.com 

The  explosive  growth  in  data  traffic  will  necessitate  that  service  providers  deploy  networks  with 
hierarchical  logical  structures.  In  a  hierarchical  network,  each  layer  is  designed  to  hide  the  details  of  its 
layer  from  those  above,  thus  enabling  cost  effective  and  scalable  network  growth.  At  each  layer,  the 
appropriate  traffic  granularity,  switching  and  grooming  technology,  and  both  physical  and  logical 
topologies  must  be  selected.  Significant  cost  benefits  can  be  realized  in  both  capital  and  operational 
expenditures.  Multi-tiered  architectures  can:  take  advantage  of  cost-effective  banded  switching 
technologies  as  described  in  [1];  enable  a  large  amount  of  optical  bypass,  which  reduces  the  amount  of 
required  electronic  termination  equipment;  and  reduce  the  number  of  elements  that  need  to  be  managed, 
thereby  simplifying  network  operation.  Also,  with  respect  to  restoration,  a  hierarchical  scheme  provides 
efficient  use  of  restoration  capacity,  reduces  the  amount  of  electronic  access  equipment  that  needs  to  be 
deployed  for  the  capacity  to  be  effectively  shared,  and  simplifies  the  restoration  decision  process  [1], 

A  good  example  of  a  hierarchical  architecture  is  that  of  the  two-tiered  scheme  shown  in  Figure  1 .  In  this 
architecture,  the  upper  layer  is  comprised  of  localized  collector  rings,  operating  at  granularities  of  DS-3  or 
OC-3,  whereas  the  lower  layer  consists  of  a  nationwide  ‘express  mesh’  operating  at  granularities  of 
OC-48,  OC-192,  and  higher.  Each  collector  ring  is  typically  populated  with  SONET  Add/Drop 
Multiplexers  (ADMs),  with  all  nodes  in  the  ring  being  logically  connected  (although  not  necessarily  on 
all  wavelengths).  The  functions  of  the  collector  ring  are  two-fold:  route  traffic  within  the  ring,  and  collect 
inter-ring  traffic  and  deliver  it  to  the  large  mesh  nodes  located  on  that  ring.  The  ring  topology  is  suitable 
for  this  layer  due  to  the  limited  geographic  extent  of  the  collector  rings  and  the  simple  protection 
properties  of  rings.  Furthermore,  many  carriers  already  have  made  huge  investments  in  SONET-ring- 
based  networks.  The  capacity  of  these  rings  is  currently  being  exhausted  by  the  tremendous  growth  in 
traffic,  exacerbated  by  the  inefficiencies  of  inter-ring  routing.  These  legacy  rings  can  be  more  effectively 
utilized  with  their  role  restricted  to  serving  as  the  collector  network  -  SONET  ADMs  are  well  suited  for 
adding,  dropping,  and  grooming  traffic,  and  current  capacity  demands  will  be  lessened  by  migrating  the 
inter-ring  traffic  to  an  express  mesh.  Alternatively,  a  mesh  architecture  with  grooming  cross-connects  can 
be  used  for  the  collector  network. 

As  opposed  to  the  fine  granularity,  limited  extent  nature  of  the  collector  network,  the  lower-layer  express 
mesh  of  Figure  1  must  carry  large  amounts  of  traffic  for  long  distances.  These  requirements  are  best  met 
with  emerging  ultra-long  reach,  high-capacity  transport  systems  and  all-optical  switches.  Since  the 
collector  rings  perform  the  function  of  aggregation,  the  express  mesh  can  carry  traffic  at  coarser 
granularities  -  OC48  and  above.  Not  all  network  nodes  are  included  in  the  logical  mesh;  only  those 
nodes  that  generate  traffic  at  levels  comparable  to  the  granularity  of  this  level,  or  those  that  are 
strategically  located,  should  be  included.  Thus,  depending  on  traffic  patterns  in  the  network,  the  express 
mesh  will  completely  bypass  a  major  portion  of  the  nodes.  To  take  full  advantage  of  this  architecture, 
ultra-long  reach  transport  must  be  deployed  such  that  traffic  can  be  routed  between  (and  through)  lower- 
layer  nodes  without  any  intermediary  regeneration.  The  combination  of  a  sparsely  populated  express 
mesh  (roughly  20  nodes  nationwide)  with  ultra-long  reach  transport  eliminates  a  significant  portion  of 
electronic  termination  in  the  network  [2], 

In  addition  to  limiting  the  nodes  that  are  included  in  the  lower  layer,  it  is  also  important  to  create  a 
‘streamlined’  physical  topology.  Thus,  fiber  links  that  do  not  pass  by  major  nodes  may  not  be  included  in 
this  layer.  First,  not  populating  a  fiber  route  eliminates  the  capital  and  operational  costs  of  deploying 
amplifiers  on  that  route.  Second,  streamlining  the  underlying  topology  results  in  larger  bundles  of  traffic 
that  can  be  routed  at  lower  cost;  i.e.,  restricting  the  diversity  of  routes  produces  more  efficiently  packed 
traffic  pipes.  While  removing  fiber  routes  results  in  greater  traffic  on  the  remaining  routes,  next- 
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generation  high-capacity  transport  systems  can  accommodate  such  high  bandwidth  demands.  In  addition, 
while  fewer  fiber  paths  may  result  in  somewhat  longer  end-to-end  paths  for  some  connections,  this  very 
likely  does  not  lead  to  more  regeneration  if  ultra-long  reach  systems  are  deployed.  Of  course,  restoration 
options  must  also  be  considered  when  laying  out  the  physical  topology. 

A  mesh  topology  is  most  appropriate  for  the  lower  level  since,  through  the  use  of  switches,  it  creates 
virtual  topologies  that  can  be  optimized  for  the  routing  of  working  traffic  and  the  sharing  of  protection 
bandwidth.  One  of  the  most  significant  benefits  of  a  hierarchical  architecture,  in  terms  of  cost  and 
scalability,  is  that  it  enables  the  deployment  of  a  hierarchical  switch  architecture,  as  shown  in  Figure  2. 
The  streamlined  architecture  of  an  express  mesh  creates  large  bundles  of  traffic  that  can  be  switched  as  a 
single  unit.  For  example,  increasing  the  switching  granularity  by  a  factor  of  B  decreases  the  switch  fabric 
size  by  a  factor  of  somewhere  between  BlogB  and  B2,  depending  on  the  switch  technology.  This  directly 
translates  to  a  saving  in  switch  cost,  switch  size,  number  of  physical  interfaces,  power,  and  switch  states 
that  must  be  managed.  In  Figure  2,  we  assume  there  are  two  levels  of  switches,  operating  at  granularities 
of  Gi  and  G2,  where  Gi  is  coarser  than  G2.  With  efficient  packing  algorithms,  only  a  small  portion  of  the 
traffic  needs  to  be  switched  at  the  finer  granularity;  the  remainder  is  more  cost  effectively  switched  at  the 
coarser  level.  While  the  switches  can  operate  in  the  optical  or  electrical  domain,  to  best  take  advantage  of 
an  express  mesh,  optical  switches  should  be  used  for  the  coarse  granularity  traffic.  This  enables  large 
bundles  of  traffic  to  express  through  the  node  without  any  electronic  termination. 

Thus  far,  we  have  discussed  a  two-tiered  approach  to  network  design.  Ultimately,  what  will  drive  the 
number  of  layers  and  the  granularity  of  each  layer  is  cost  and  technology.  Assume  that  the  cross-sectional 
traffic  on  any  link  in  a  particular  layer  of  the  network  is  T,  and  assume  that  the  finest  level  of  granularity 
required  at  that  layer  is  G.  Then  the  number  of  elements  per  link  that  are  potentially  switched  are  T/G. 
At  a  node  of  degree-N,  this  translates  to  a  switch  size  of  approximately  (NT/G)x(NT/G).  Due  to  limits  of 
technology  and  physical  space,  the  switch  size  may  be  limited,  resulting  in  multiple  tiers,  where,  for  the  ith 
tier,  Tj  and  G,  are  chosen  such  that  NTj/Gj  remains  tractable.  This  will  enable  networks  to  continue  to 
grow  gracefully. 


We  have  presented  a  qualitative  argument  for  the  economic  and  architectural  benefits  of  deploying 
hierarchical  networks,  and  outlined  how  these  benefits  can  be  best  realized  with  new  generation  transport 
and  switching  systems. 


Figure  1.  An  example  of  a  two-level  hierarchy,  where 
a  group  of  collector  rings  aggregates  and  grooms  fine 
granularity  traffic,  and  delivers  traffic  to  a  streamlined, 
coarse  granularity  express  mesh. 


Adds  Drops 


Figure  2.  A  hierarchical  architecture  can  take 
advantage  of  hierarchical  switching.  At  the  lower  level, 
an  OXC  switches  traffic  at  a  coarse  granularity  (GI). 
Only  a  small  percentage  of  the  traffic  needs  to  be 
switched  at  the  finer  granularity  (G2). 


References: 

[1]  Simmons,  J.M.,  “Hierarchical  Restoration  in  a  Backbone  Network,”  OFC’99,  Feb.  21-26,  1999,  San  Diego,  CA. 

[2]  Saleh,  A.A.M.,  “Transparent  Optical  Networking  in  Backbone  Networks,”  OFC’OO,  Mar.  7-10,  2000,  Baltimore, 
MD. 


4 


WC1.2  (Invited) 
2:00pm-2:30pm 


Digital  Wrappers  for  IP  over  WDM  systems 


Paul  Bonenfant*  and  Antonio  Rodriguez  Moral 
Lucent  Technologies 

pbonenfan  t@lucent.  com,  arodmor@lucent.  com 

*101  Crawfords  Comer  Road,  Room  3C-515 
Holmdel,  NJ  07733 
Tel:  +1  732  949  8760 
Fax:  +1  732  949  1124 


Summary 

There  is  considerable  interest  within  the  rapidly  converging  telecommunications  and  data 
communications  networking  environments  around  a  simplification  of  the  data/transport  network  known 
as  “IP  over  WDM.”  The  ubiquity  and  critical  role  of  core  LP  routers,  and  the  emergence  of  Label 
Switching  Routers  (LSRs)  based  on  Multi-Protocol  Label  Switching  (MPLS)  technology,  when 
combined  with  Optical  Transport  Networks  (OTNs)  has  also  raised  the  issue  of  optimized  network 
architectures  for  the  Optical  Internet. 

In  marketing  terms,  the  notion  of  “IP  over  WDM”  makes  for  remarkable  hype:  take  the  most  widely 
talked  about  data  networking  protocol — IP — and  combine  it  with  the  ultimate  in  bandwidth 
provisioning — WDM — to  realize  the  ultimate  “blend”  of  hot  technologies.  What  is  meant  by  “IP  over 
WDM,”  however,  is  less  than  clear.  The  intention  of  this  paper  is  to  demystify  the  notion  of  “IP  over 
WDM,”  and  in  doing  so,  to  indicate  what  features  may  be  required  in  future  “collapsed”  data/transport 
networks. 


In  reality,  “IP  over  WDM”  invariably  implies  one  of  a  multiplicity  of  mappings  of  IP  onto  fiber  (or 
wavelengths),  as  illustrated  in  Figure  1.  To  state  that  any  one  particular  mapping  represents  “IP  over 
WDM”  is  extremely  disingenuous,  and  ignores  the  fact  that  every  data  network  is  unique  in  a 
marketplace  governed  by  differentiation. 
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Figure  1.  Variations  on  a  Theme:  What  is  “IP  over  WDM”? 
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Commensurate  with  the  emergence  of  “IP  over  WDM”  protocol  mappings  is  the  demand  for  transport 
networking  at  unprecedented  levels  of  granularity — on  the  order  of  Gigabits  to  tens  of  Gigabits  per 
second — and  the  evolution  from  time-division  multiplexing  (TDM)  to  Optical  Transport  Networking  via 
WDM.  We  discuss  the  move  from  TDM  to  Optical  Channel  networking  as  a  natural  step  in  the  evolution 
of  the  transport  network,  which  will  be  predicated  upon  the  support  of  operations,  administration,  and 
maintenance  functions  at  the  Optical  Channel  layer. 

Current  deployments  of  IP  networks  presuppose  that  delivery  will  be  on  a  “best  effort”  basis.  However, 
many  network  operators  want  to  provide  new  services  that  require  traffic  management  of  IP  flows.  One 
of  the  most  significant  of  these  is  virtual  private  networking,  where  one  customer’s  traffic  is  separated 
from  another  in  a  single  IP  network  with  well-defined  quality  of  service  and  security.  While  this  is  not 
possible  with  today’s  “best  effort”  IP  network,  it  can  be  achieved  by  means  of  a  protocol  that  allows  IP 
packets  to  be  partitioned  into  traffic  flows  that  can  be  manipulated  in  their  own  right.  Examples  of  such 
protocols  include  ATM,  and  MPLS.  The  partitioned  flows  must  then  be  encapsulated  to  be  carried  over 
WDM.  From  the  perspective  of  IP,  it  is  of  no  consequence  if  IP  is  partitioned  or  not,  only  that  it  is 
encapsulated. 

The  management  of  optical  networks  will  also  undergo  a  profound  evolution.  The  emergence  of  digital 
wrapper  technology  will  enable  optical  networks  to  provide  similar  management  features  to  those 
existing  today  in  Synchronous  Optical  Network  (SONET)  and  Synchronous  Digital  Hierarchy  (SDH) 
networks,  while  at  the  same  time  opening  the  optical  layer  to  a  wide  variety  of  client  signals.  This  will 
allow  service  providers  to  accommodate  different  protocol  stacks  and  network  architectures  for  their 
next-generation  IP  backbones.  This  feature,  known  as  service  transparency,  together  with  the  required 
technologies  and  protocols  for  optical  switching,  protection  and  restoration,  are  critical  in  a  marketplace 
that  will  be  governed  by  service  differentiation  and  rapid  service  provisioning.  We  present  a  perspective 
on  the  impact  of  technologies  such  as  digital  wrappers,  optical  switching  and  optical  layer  protection  and 
restoration,  on  the  services  and  capabilities  of  next-generation  optical  transport  networks  and  optical 
internetworks. 

Another  issue  under  intense  research  and  debate  is  the  design  of  the  control  plane  of  next-generation 
OTNs  and  optical  internetworks.  The  concept  of  Multi-Protocol  Lambda  Switching  introduced  in  the 
Internet  Engineering  Task  Force  (IETF)  has  emerged  as  a  potential  solution,  but  it  has  also  added  a 
degree  of  confusion  to  the  problem.  We  introduce  some  of  the  concepts  and  protocols  that  are  under 
consideration  in  the  IETF  and  other  fora,  including  the  Optical  Internetworking  Forum  (OIF),  the  Optical 
Domain  Service  Interconnect  (ODSI)  initiative,  and  the  Amercian  National  Standards  Institute 
Committee  T1  -  Telecommunications  (ANSI  Tl)  and  International  Telecommunications  Union  (ITU) 
standards  bodies.  The  goal  of  these  initiatives  is  to  enable  rich  services  such  as  rapid  optical  channel 
provisioning,  protection  and  restoration,  optical  layer  network  topology  auto-discovery,  and  optical 
bandwidth  management  services. 

Finally,  we  propose  that  true  optical  internetworks  will  not  result  from  a  blind  simplification  of  the 
protocol  stack,  but  rather  from  a  well-engineered  combination  of  network  architectures,  protocol  stacks, 
service  transparency,  and  enhanced  optical  layer  management.  Enabled  by  the  use  of  digital  wrappers  in 
the  OTN,  together  with  an  innovative  control  plane  that  reuses  and  adapts  many  existing  protocols  from 
the  IP  world,  the  Optical  Internet  will  provide  feature-rich  services  that  have  yet  to  be  implemented  in 
legacy  data  networks  and  traditional  WDM-based  optical  transport  networks. 
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Traffic  engineering  in  IP/ WDM  networks  can  be  pursued  in  two  fashions:  overlay  and  integrated  as 
shown  in  the  figure  below.  With  overlay  traffic  engineering,  operations  in  one  layer  can  be  independent 
from  those  in  the  other  layer.  Traffic  engineering  solutions  developed  for  either  IP  networks  or  WDM 
networks  can  be  directly  applied  to  each  layer,  respectively.  At  the  present  stage,  an  anticipated  first  step 
towards  IP  over  WDM  is  to  hook  up  IP  router  interfaces  to  OADM  (optical  add/drop  multiplexer)  client 
add/drop  ports.  This  forms  an  overlay  network  in  which  WDM  serves  as  the  server  layer  and  IP  behaves 
like  the  client  layer.  This  overlay 
client-server  network  is  a  natural 
match  to  the  notion  of  overlay  traffic 
engineering.  With  integrated  traffic 
engineering,  optimization  with  respect 
to  selected  objectives  is  pursued 
coordinately  across  both  the  IP  and 
WDM  layers.  It  is  made  possible  by 
mechanisms  such  as  Multi-Protocol 
Lambda  Switching  [1]  that  allows  a 
unified  control  plane  across  both 
layers.  With  the  emergence  of  more 
sophisticated  hardware  that  integrates 
functionality  of  both  IP  and  WDM  at 
each  network  element  (NE),  integrated 
traffic  engineering  can  be  performed 
more  naturally  and  efficiently. 

IP/WDM  Traffic  Engineering  Paradigms:  With  overlay  traffic  engineering,  operations  in  one  layer 
are  pursued  independent  from  those  in  the  other  layer.  Traffic  engineering  solutions  developed  for  either 
IP  networks  or  WDM  networks  are  applied  directly  to  their  respective  layer,  with  little  or  no  inter-layer 
coordination.  In  IP/WDM  networks,  traffic  engineering  in  the  IP  layer  can  theoretically  be  effected  via  IP 
routing  algorithms  that  can  adapt  the  IP  packet  routes,  and  perform  load  balancing.  Likewise,  traffic 
engineering  in  the  WDM  layer  can  be  effected  through  light  path  reconfiguration  that  adapts  the  IP 
network’s  virtual  topology  to  the  evolving  traffic  pattern.  In  the  case  of  switched  WDM,  the  WDM  layer 
can  also  adjust  the  number  of  wavelengths  allocated  to  each  virtual  link,  thereby  affecting  the  bandwidth 
and  contention  statistics  of  the  virtual  link.  In  practice,  because  traditional  IP  routing  algorithms  are  all 
oblivious  to  traffic  loading,  traffic  engineering  in  today’s  IP  networks  often  relies  entirely  on  link  layer 
adjustments.  For  example,  an  IP  over  ATM  network  typically  maintain  a  full  IP  mesh  connectivity,  and 
rely  entirely  on  the  ATM  layer  to  adjust  the  bandwidths  provisioned  for  the  ATM  virtual  circuits  that 
support  the  IP  links.  These  ATM  layer  actions  are  completely  transparent  to  the  IP  layer  routing 
algorithm.  Overlay  traffic  engineering  is  most  likely  adopted  in  near-term  deployment  of  overlay 
IP/WDM  networks. 

With  the  emergence  of  MPLS-based  IP  routers,  and  MP^S-based  WDM  OXCs,  it  is  now  possible  to 
define  an  integrated  control  plane  for  both  layers,  which  makes  it  easier  to  pursue  a  more  integrated 
approach.  With  integrated  traffic  engineering,  optimization  with  respect  to  selected  objectives  is  pursued 
coordinately  across  both  the  IP  and  the  WDM  layers.  All  of  the  IP  and  WDM  traffic  engineering 
mechanisms  described  above  still  exist,  and  can  now  be  considered  together.  Time  scale  and  traffic 
granularity  considerations  will  determine  which  specific  mechanisms  from  the  different  layers  are  most 
suitable  for  different  traffic  conditions.  With  more  sophisticated  hardware  that  integrates  functionality  of 
both  IP  and  WDM  at  each  network  element  (NE),  integrated  traffic  engineering  can  be  performed  even 
more  naturally  and  efficiently.  Emerging  technologies  such  as  Optical  Label  Switching  (OLS)  [2]  will 
further  enrich  the  set  of  available  traffic  engineering  choices. 
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Traffic  Engineering  Functional  Components:  Regardless  of  which  traffic  engineering  approach  is 
adopted,  an  IP/WDM  traffic  engineering  solution  will  always  consist  of  a  number  of  basic  functional 
components.  We  enumerate  the  most  important  components  below: 

Traffic  monitoring,  analysis  and  aggregation  is  responsible  for  collecting  traffic  statistics  from  the 
network  elements,  e.g.  IP  routers  and  OXC.  Then  the  statistics  are  analyzed  and/or  aggregated  to  prepare 
for  the  traffic  engineering  and  network  reconfiguration  related  decision-making. 

Bandwidth  demand  projection  projects  the  bandwidth  requirements  in  the  near  future  based  on  past  and 
present  measurements  and  the  characteristics  of  the  traffic  arrival  processes.  The  bandwidth  projections 
are  used  for  subsequent  bandwidth  allocation. 

Reconfiguration  trigger  consists  of  a  set  of  policies  that  decide  when  a  network  level  reconfiguration  is 
performed.  This  is  based  on  traffic  measurements,  bandwidth  predictions,  and  on  operational  issues,  e.g., 
to  suppress  influence  of  transitional  factors  and  reserve  adequate  time  for  network  to  converge. 

Topology  design  provides  a  network  topology  based  on  the  traffic  measurements  and  predictions. 
Conceptually  this  can  be  considered  as  optimizing  a  graph  (i.e.  IP  routers  connected  by  light  paths  in  the 
WDM  layer)  for  specific  objectives  (e.g.  maximizing  throughput),  subject  to  certain  constraints  (e.g. 
nodal  degree,  interface  capacity),  for  a  given  load  matrix  (i.e.  traffic  load  applied  to  the  network.)  This  is 
in  general  a  NP-hard  problem.  Since  reconfiguration  is  regularly  triggered  by  continually  changing  traffic 
patterns,  an  optimized  solution  may  not  be  stable.  It  may  be  more  practical  to  develop  heuristics  that 
emphasize  more  on  factors  like  fast  convergence,  and  less  impacts  on  ongoing  traffic,  than  on  optimality. 
Topology  migration  consists  of  algorithms  to  coordinate  the  network  migration  from  an  old  topology  to  a 
new  topology.  As  WDM  reconfiguration  deals  with  large-capacity  channels,  changing  allocation  of 
channel  resources  in  this  coarse  granularity  has  significant  effects  on  a  large  number  of  end  user  flows. 
Traffic  flows  have  to  adapt  to  the  light  path  changes  at  and  after  each  migration  step.  These  effects  can 
potentially  spread  over  the  routing  pattern  of  the  network,  which  in  turn  may  affect  more  user  flows. 

Simulation  Results:  To  gauge  the  benefits 
possible  in  IP/WDM  traffic  engineering,  we 
performed  extensive  simulation  studies.  The 
topology  used  in  the  simulation  is  derived 
from  a  major  carrier’s  IP  backbone  covering 
the  United  States.  The  simulation’s  offered 
traffic  loads  captured  typical  Internet  traffic 
characteristics,  such  as  sudden  surge  between 
some  node  pairs,  spatially  unbalanced  loading 
and  regional  congestions.  The  different  loads 
emulate  short-term  hot-spot  problems,  and 
long  term  unbalanced  resource  usage  due  to  poor  planning.  Simulation  results  show  that  network 

performance  is  improved  in  many  ways:  overall  IP 
traffic  throughput  is  increased  when  the  traffic  pattern 
is  skewed;  the  congestion-free  region  is  expanded  by 
the  same  scale  (top  right  figure,)  network  resiliency  is 
also  enhanced.  The  simulation  also  shows  that  network 
wide  average  packet  end-to-end  latency  improved  in 
all  load  regions  (middle  left  figure.)  These  delay 
improvements  are  the  results  of  more  efficient  network 
resource  allocation,  which  migrates  loads  on  congested 
links  to  less  utilized  ones  (right  bottom  figure.) 
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1.  Introduction 

DWDM  point-to-point  transport  systems  have  been  widely  deployed  in  the  backbone  networks.  Recently  research 
and  development  have  been  focusing  on  developing  the  reconfigurable  optical  networking  systems  and  technologies 
that  would  possess  network  intelligence  to  directly  accommodate  IP,  ATM  as  well  as  traditional  voice  traffic.  In  this 
paper  we  report  the  successful  transport  of  Gigabit  Ethernet  for  1062  km  over  the  multi-vendor,  reconfigurable, 
MONET  Washington  DC  network. 

Tellium  Lucent 


Fig.  1.  MONET  Washington  DC  network. 

2.  Network  Architecture 

The  MONET  Washington  DC  Network  consists  of  two  two-fiber  rings  as  shown  in  figure  1.  The  east  ring  is 
provided  and  maintained  by  Lucent  Technologies.  This  ring  consists  of  two  wavelength  selective  cross-connects 
(WSXCs)  in  the  Laboratory  for  Telecommunication  Sciences  at  National  Security  Agency  (NSA/LTS)  and  the 
Navel  Research  Laboratory  (NRL),  one  wavelength  add/drop  multiplexer  (WADM)  at  National  Aeronautics  and 
Space  Administration  (NASA),  and  two  wavelength  amplifiers  (WAMPs)  in  the  Bell  Atlantic  central  office.  The 
west  ring  is  provided  by  Tellium  and  managed  by  Telcordia  Technologies.  This  ring  consists  of  three  WADMs  at 
the  Defense  Advanced  Research  Projects  Agency/Information  Systems  Institute  (DARPA/ISI),  Defense  Information 
Systems  Agency  (DISA),  and  Defense  Intelligence  Agency  (DIA)  as  well  as  four  WAMPs  in  the  Bell  Atlantic 
central  offices.  Bell  Atlantic  provides  the  in-ground  fiber  infrastructure  for  the  network.  The  whole  network  is 
controlled  and  managed  by  Telcordia  Technologies’  CORBA  based  network  control  and  management  (NC&M) 
system  which  runs  on  the  ATM  network  at  OC-3c  rate  over  the  1510  nm  optical  supervisory  channel. 

The  switching  fabric  of  Lucent  network  elements  is  based  on  lithium  niobate  technology.  It  is  therefore 
transparent.  The  Tellium  WADM  is  based  on  wavelength  transponders  and  transceivers  of  OEO  conversion  and 
electrical  switching  fabric.  Thus  the  west  ring  is  opaque  and  has  the  wavelength  interchange  capability. 

3.  Gigabit  Ethernet 

The  Tellium  west  ring  has  4  modes  of  transmission:  OC-3,  OC-12,  OC-48,  and  OS1G  (optical  signal).  The  OSIG 
mode  is  to  transmit  non-SONET  rate  signals  up  to  2.5  Gbps  without  timing  recovery  during  the  OEO  conversion. 

We  have  conducted  an  experiment  to  test  the  long  distance  transmission  of  Gigabit  Ethernet  (GbE)  using  the 
OSIG  mode.  The  equipment  used  in  the  experiment  were:  two  Pentium  II  workstations  as  test  hosts  running 
FreeBSD  3.2  with  Netgear  GA620  GbE  Cards  with  850  nm  SX  ports;  these  two  hosts  were  connected  to  an  Extreme 
Networks  Summit  1  GbE  switch.  The  switch  was  configured  into  two  virtual  LAN  (VLAN),  with  the  test  hosts  each 
residing  on  a  separate  VLAN.  This  configuration  prevented  the  switch  fabric  from  bridging  packets  between  the  two 
hosts  directly.  Instead,  packets  were  forced  over  the  switch  LX  GbE  ports  (1310  nm).  The  link  from  the  host  named 
‘bug’  to  the  host  named  ‘short’  was  under  testing.  The  received  optical  power  was  attenuated.  The  experiment  was 
to  measure  the  packet  loss  rate  or  frame  loss  rate  versus  the  received  power. 

The  first  step  was  to  measure  a  back-to-back  baseline  and  a  network  baseline.  The  network  baseline  was 
obtained  by  configuring  the  WADM  that  the  GbE  packets  traveled  through  the  WADM  switch  fabric  only.  We  then 
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sent  the  GbE  signals  out  of  the  node  over  the  fibers  towards  DISA,  DIA  and  all  the  way  around  the  ring  back  to 
DARPA/ISI  and  looped  back  at  each  site  with  distance  of  100  km,  238  km  and  412  km,  respectively. 


Fig.  2.  UDP  packet  loss  rate  vs.  received  power.  Fig.  3.  Time  jitter  vs.  transmission  distance. 

Multi-generator  (MGEN)  and  Test  TCP  (TTCP)  were  used  to  generate  frames  filled  with  UDP  packets  for  the 
test.  The  UDP  packets  were  1400  byte  in  size  and  each  packet  filled  in  one  frame.  The  frames  transmitted  and 
received  were  monitored  and  the  frame  loss  rates  or  the  packet  loss  rates  were  calculated. 

The  experimental  results  are  shown  in  figure  2.  The  power  penalty  for  the  network  baseline  over  the  back-to- 
back  baseline  is  about  0.5  dB  at  10%  packet  loss  rate.  The  power  penalties  for  100  km,  238  km  and  412  km 
transmissions  are  1.2,  2  and  2.5  dB.  The  maximum  received  power  for  the  Summit  1  switch  is  -3  dBm.  The  received 
power  for  412  km  transmission  at  less  than  1%  packet  loss  rate  is  -22  dBm.  This  gives  a  system  margin  of  20  dB, 
which  indicates  the  possibility  of  transmission  for  more  than  1000  km  over  the  wide  area  optical  network. 


Fig.  4.  Eye  diagrams  at  0,  412  and  824  km. 


Testing  one  switch  with  two  VLANs  might  not  show  any  time  synchronization  problems.  We  used  two  switches 
to  investigate  the  timing  issue.  We  did  not  observe  any  significant  performance  differences. 

In  order  to  achieve  long  distance  transmission,  we  connected  ‘short’  to  ‘bug’  directly  through  the  switching 
fabric.  And  we  used  X.3,X,4  and  for  the  link  from  ‘bug’  to  ‘  short’  to  have  a  total  of  1236  km  transmission  distance. 

We  suspected  that  the  limiting  factor  for  the  long  distance  transmission  of  the  GbE  was  the  time  jitter 
accumulation.  Thus  we  measured  the  peak-to-peak  time  jitter  from  the  eye  diagrams.  Figure  3  shows  the  time  jitter 
versus  the  transmission  distance  with  and  without  3R  regeneration.  The  dashed  curve  is  without  any  3R  regeneration 
and  the  maximum  distance  is  824  km  where  the  jitter  is  588  ps,  almost  reaching  the  GbE  standard  of  total  jitter 
budget  599  ps  [1].  Figure  4  shows  the  eye  diagrams  at  0,  412  and  824  km.  The  corresponding  eyes  experienced  2,  18 
and  36  2R  regeneration.  The  measurement  was  made  at  normal  operation  condition  with  -10  dBm  received  power. 

In  order  to  increase  the  transmission  distance,  we  built  a  3R  regenerator  to  overcome  the  time  jitter 
accumulation.  We  put  the  regenerator  at  412  km.  The  time  jitter  has  been  improved  from  478  ps  to  188  ps,  almost 
the  same  as  that  of  the  0  km  eye  which  is  141  ps.  We  increased  the  transmission  distance  to  reach  a  maximum  of 
1062  km  where  jitter  reaches  579  ps.  Again,  beyond  this  distance  the  link  was  down.  The  jitter  vs.  distance  is  shown 
as  the  solid  curve  in  Fig.  3.  We  also  put  the  3R  regenerator  at  650  km  and  824  km.  In  both  cases,  we  could  not  reach 
1062  km  distance.  It  seems  that  when  time  jitter  reaches  400  ps,  which  is  half  of  the  bit  period,  it  requires  3R 
regeneration;  additionally  when  the  time  jitter  reaches  599  ps,  no  data  can  be  transmitted. 

4.  Conclusions 

For  the  first  time,  to  our  knowledge,  we  have  transported  Gigabit  Ethernet  packets  directly  over  WDM  for  1062  km 
in  the  successfully  deployed  MONET  Washington  DC  network.  The  experiment  indicates  that  the  Gigabit  Ethernet 
can  be  used  for  WAN  or  backbone  transmission  bypassing  the  ATM  and  SONET  intermediate  protocol  layers.  It 
also  indicates  that  the  physical  layer  of  the  Fibre  Channel,  which  is  adopted  by  the  Gigabit  Ethernet,  can  be  used  as  a 
long-haul  transmission  protocol. 

[1]  IEEE  Standard  802.3,  1998  Edition. 
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Innovation  of  optical  technologies  for  telecommunications  has  acquired  an  extremely  rapid  pace  in  the  last 
year.  As  the  recent  OSA  Optical-Fiber-Communications  2000  Conference  exhibit  very  well  testify,  all  the  main 
components  necessary  for  switching  and  transmission  of  WDM  lightpaths  are  now  available  as  commercial 
products  on  the  market.  WDM  transmission-system  capacity  has  grown  towards  hundreds  of  WDM  channels, 
each  one  modulated  at  10  Gbit/s  or  more,  over  very  long  transmission  spans;  opto-electronic  regenerators  and 
wavelength  converters  operate  in  the  10  Gbit/s  range;  Optical  Add-Drop  Multiplexers  (OADMs)  are  already 
quite  common  devices,  while  several  Optical  Cross-Connect  (OXC)  products  have  been  recently  presented  or 
announced.  This  technological  evolution  makes  the  full  development  of  WDM  networking  a  reality:  new  complex 
meshed  topologies  can  be  developed  at  the  side  of  more  traditional  and  simpler  schemes  such  as  WDM  rings; 
wavelength  conversion  can  be  used  to  add  flexibility,  while  lightpath  control  algorithms  can  be  upgraded  to 
improve  the  efficiency  of  connection  provisioning. 

In  this  ever-changing  scenario  also  the  relationship  between  the  entities  that  generate  telecommunication 
traffic  and  the  entities  that  control  the  provisioning  of  transport  resources  is  evolving.  In  the  recent  past 
the  large-capacity  transport  networks  were  typically  operated  by  single  nation-wide  telephone  companies  that 
used  them  mainly  to  transport  the  traffic  generated  by  their  own  subscribers.  Today,  instead,  a  new  model  is 
becoming  more  and  more  frequent  in  which  an  optical  transport  network  operator  provides  capacity  on-demand 
in  terms  of  optical  connections  (lightpaths)  to  independent  customers  such  as  Internet  service  providers  or  the 
telephone  companies  themselves.  Thanks  to  the  transparency  of  the  optical  network,  once  a  customer  has  leased 
a  lightpath  he  is  able  to  manage  it  at  his  will  and  according  to  the  need  of  the  services  he  is  providing  to  its  own 
subscribers,  for  example  adopting  the  most  suitable  electronic  transport  technology  with  the  option  of  choosing 
among  SDH,  raw  IP  packets,  raw  ATM  cells,  raw  HDTV  data  and  so  on.  This  makes  the  WDM  layer  a  common 
platform  for  heterogeneous  customers  and  service  providers. 

The  evolution  we  depicted  implies  also  that  the  approach  to  WDM  network  analysis  and  design  may  change. 
In  the  past  decade  a  large  number  of  studies  have  been  carried  out  aimed  at  the  cost  optimization  of  a  WDM 
network  given  the  forecast  average  traffic-demand,  evaluated  in  advance  on  the  basis  of  the  previous  utilization 
history.  This  approach,  that  we  shall  call  static-traffic  WDM  design,  is  no  longer  easily  applicable  for  a  network 
operator  of  the  new  type  that  not  only  ignores  how  many  customers  it  will  have  in  a  given  time  in  future,  but 
also  what  kind  of  service  they  will  provide  to  their  subscribers.  How  is  it  possible  to  predict  the  values  of  a 
static  traffic-matrix  in  these  conditions?  Incidentally  it  must  be  considered  that  some  of  the  services  that  the 
optical  transparency  may  allow  to  support  may  well  be  entirely  new,  without  a  case-history  on  which  to  base 
statistical  predictions. 

A  more  effective  approach  to  WDM  network  planning  may  be  based  on  the  dynamic  traffic,  where  by  this 
term  we  mean  that  requests  for  lightpaths  can  arrive  to  the  network  at  any  random  time  instants,  when 
other  connections  are  already  active.  If  the  new  request  can  be  accommodated  without  disrupting  the  existing 
connections,  a  lightpath  is  created  and  it  is  maintained  active  for  the  whole  duration  of  the  connection  specified 
by  the  customer  that  asked  for  the  resources.  The  performance  of  the  network  in  this  scenario  can  be  well 
measured  by  the  blocking  probability  of  the  network,  i.e.  the  probability  that  a  new  potential  customer  can  not 
be  satisfied  due  to  shortage  of  transmission  resources. 

Optimal  design  criteria  in  these  conditions  have  been  up  to  now  quite  seldom  investigated  and  are  not  as 
known  as  in  the  static-traffic  case.  Useful  elements  to  optimize  complex  systems  always  come  from  the  analysis 
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of  their  behavior  in  various  conditions.  Therefore  in  the  present  contribution  we  would  like  to  summarize  the 
studies  we  have  carried  out  concerning  the  behavior  of  both  ring  and  mesh  WDM  networks  under  dynamic 
traffic. 

The  analyses  we  performed  are  all  based  on  a  model  of  the  network  which  allows  to  run  a  set  of  simulations 
in  which  the  events  of  lightpath-setup  and  lightpath-tear-down  request  are  generated  by  the  nodes  according 
to  suitably  defined  random  processes.  A  lightpath  setup  implies  solving  a  problem  of  Routing  and  Wavelength 
Assignment  (RWA)  for  the  new  connection,  according  to  a  certain  set  of  RWA  algorithms.  The  blocking  prob¬ 
ability  is  then  evaluated  by  gathering  the  statistical  data  from  the  simulation  run  in  steady-state  conditions. 
Another  measure  of  the  network  performance  can  be  provided  by  comparing  the  blocking  probability  of  con¬ 
nections  between  adjacent  nodes  and  between  very  distant  nodes.  This  measure  is  called  fairness:  a  network 
providing  a  good  quality  service  should  be  equally  fair  to  all  the  potential  customers  regardless  of  the  distance 
of  their  terminal  nodes. 

In  particular  we  intended  to  understand  the  effects  of  certain  network-design  approaches  and  system  para¬ 
meters  on  the  blocking  probability.  We  have  considered  the  cases  that  we  are  going  to  very  briefly  summarize 
in  the  following. 

WDM  ring  networks.  Wavelength  conversion  capacity  increases  the  flexibility  of  a  WDM  network  and 
its  capacity  to  accommodate  new  connections.  However  converters  are  expensive  devices:  we  considered 
therefore  the  solution  of  placing  converters  only  in  some  nodes  of  a  ring  WDM  network.  Both  the  unidi¬ 
rectional  and  bidirectional  rings  were  tested  and  converters  were  uniformly  distributed  among  the  nodes 
or  placed  in  adjacent  nodes  [1],  [2]. 

The  analyses  were  carried  out  employing  different  types  of  connection-request  generators  to  simulate  situ¬ 
ations  in  which  the  traffic  is  smooth,  regular  (Poissonian)  or  peaked.  This  was  done  in  order  to  take  into 
account  a  wide  range  of  traffic  types,  since  there  is  no  precise  idea  yet  of  how  to  statistically  characterize 
the  lightpath  traffic. 

Our  analysis  showed  that,  provided  wavelength  converters  are  evenly  distributed,  only  a  few  of  them  are 
sufficient  to  lower  the  blocking  probability  almost  to  the  same  level  reached  when  wavelength  converters  are 
placed  in  all  the  nodes.  We  also  showed  that  wavelength  conversion  capability  is  much  more  important  to 
obtain  a  fair  network  than  to  improve  blocking,  especially  in  the  unidirectional  ring.  Finally,  the  blocking 
probability  rises  with  peaked  traffic,  but  in  this  conditions  the  network  is  more  fair. 

WDM  mesh  networks.  The  topology  of  a  mesh  network  has  a  direct  impact  on  its  blocking  properties. 
The  analysis  we  carried  out  on  the  WDM  mesh  networks  [3]  was  therefore  mainly  aimed  at  the  understand¬ 
ing  of  the  relation  between  blocking  and  fairness  and  some  topological  parameters,  such  as  nodal  degree 
and  the  average  minimum  distance  between  the  nodes.  Moreover  we  compared  the  performance  attainable 
by  different  routing  and  wavelength  assignment  algorithms.  All  the  simulations  were  repeated  twice  to 
compare  the  cases  with  full  and  absent  wavelength  conversion  capability. 

The  results  obtained  show  that  not  only  the  blocking  probability  decreases  by  increasing  the  node  degree, 
as  expected,  but  also  decreasing  the  average  minimum  distance  between  the  nodes.  Moreover  we  could 
verify  that  the  best  performing  wavelength  assignment  and  routing  algorithms  are,  respectively,  First-Fit 
and  Least  Loaded  Routing.  Finally  we  verify  that  the  wavelength  conversion  capability  is  always  useful  to 
improve  the  fairness  of  the  network,  especially  in  those  networks  having  a  low  connectivity  degree:  this  is 
the  generalization  of  the  results  we  have  obtained  concerning  rings  to  any  topology. 

In  conclusion  we  have  discussed  some  issues  concerning  WDM  network  performance  with  a  dynamic  offered 
traffic  by  showing  the  results  obtained  by  discrete-event  simulation.  Our  work  is  essentially  an  analysis  aimed 
at  finding  out  which  can  be  the  most  important  parameters  that  must  be  taken  into  account  in  designing 
WDM  networks  in  a  dynamic  environment.  We  believe  that,  thank  to  the  optical  technology  improvement,  the 
dynamic-traffic  scenario  will  become  more  and  more  important  in  the  future. 
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Summary 

The  actual  program  of  EURESCOM  -  a  common  research  and  Development  Organisation  of 
European  Operators  -  includes  the  P918  project  “Integration  of  IP  over  Optical  Networks: 
networking  and  management”.  One  of  the  main  goals  of  the  P918  project  is  to  define  the 
functional  architecture  for  a  client-independent  Optical  Transport  Network  (OTN)  carrying 
different  clients  including  also  an  optimised  IP-based  network.  In  particular,  aspects  of 
management  of  an  OTN  are  investigated  to  reach  a  recommended  view  for  Network 
Operators. 

In  general  P918  is  applying  a  transport  network  modelling  recommended  by  ITU  (Rec.805), 
which  means  that  network  scenarios,  functional  architecture  and  management  requirements 
will  be  the  basis  for  the  specification  of  atomic  functions  and  management  information 
models. 


The  presentation  will  highlight  some  relevant  results  coming  from  of  P918,  including  a 
discussion  on  the  way  to  define  the  OTN  functional  requirements,  in  line  with  the  reality  of 
implementation  of  optical  networks.  An  issue  that  has  to  be  addressed,  for  example,  is  the 
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limited  optical  transmission  length  of  un-regenerated  (3R)  optical  channels.  In  order  to 
account  for  this  effect,  the  concept  of  the  transparency  domain  has  been  introduced. 

Based  on  this  overview  and  based  on  distinction  between  protocol  transparency  and 
transmission  transparency  basic  OTN  scenarios  will  be  discussed,  which  consider  transparent 
transmission  domains,  operator  domains  and  vendor  domains.  (Fig.l) 

The  concept  of  the  different  domains  allows  to  match  the  requirements  of  standardised 
management  of  optical  networks  with  the  fairly  early  idea  of  optical  transmission 
transparency. 

The  basic  management  requirements  of  managing  the  different  domains  and  the  end-to-end 
optical  channel  will  be  introduced  and  possible  implementations  of  related  OAM  functions 
will  be  compared,  while  the  realisation  of  the  end-to-end  view  is  based  on  a  digital  frame 
format.  Two  solutions  of  integration  of  the  digital  frame  and  of  the  3R  regeneration  into  the 
functional  architecture  based  on  the  recommendation  ITU-T  G.872  will  be  discussed. 

Finally,  requirements  on  protection  and  restoration  will  be  discussed  assuming  the  viewpoint 
that  an  integration  between  layer  networks  is  required  to  optimise  network  survivability  costs. 
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1.  Introduction:  Simulation  of  network  impairments  in  wavelength-routed  networks  can  be  extremely 
time-consuming  due  to  the  intensive  computation  required  to  calculate  wavelength  assignment.  In  a  real 
network  there  may  be  many  possible  solutions  for  the  wavelength  assignment  for  the  same  input 
parameters:  in  practice  the  actual  assignment  will  depend  on  the  previous  state  of  the  network.  For  a  fair 
assessment  of  network  impairments,  and  hence  a  reliable  calculation  of  component  requirements,  the 
variation  in  the  performance  should  be  calculated,  as  well  as  the  expected  performance.  In  order  to 
address  these  three  problems  we  have  developed  a  semi-analytic  method  using  probability  theory  to 
simulate  the  flow  of  signals  through  the  network,  and  to  calculate  confidence  limits  for  the  bit  error  rate 
(BER).  Previous  results  have  been  reported  in  [1].  In  this  paper  we  describe  the  basics  of  our  network 
model,  and  use  it  to  calculate  confidence  limits  for  the  BER  due  to  the  accumulation  of  homodyne  beat 
noise. 

2.  Network  Probability  Model:  Given  traffic  data,  a  routing  algorithm  is  used  to  calculate  the  required 
number  of  wavelength  channels  per  link.  Our  analysis  then  assumes  the  wavelength  assignment  to  be 
randomly  distributed  across  the  fibres  in  each  link.  Assuming  no  wavelength  conversion  to  overcome 
wavelength  blocking  requires  a  link  utilization  (Lu)  of  less  than  100%:  hence  a  link  utilization  parameter 
is  included  as  a  measure  of  the  ‘efficiency’  of  the  wavelength  assignment  algorithm.  These  assumptions 
allow  us  to  derive  analytical  expressions  for  the  PDFs  (probability  distribution  functions)  of  the  number 
of  crosstalk  interferers  collected  in  the  space-switch,  demultiplexer  and  multiplexer  stages  of  a 
wavelength-routing  node.  Homodyne  crosstalk  can  be  categorized  according  to  whether  the  crosstalk 
terms  are  time-delayed  replicas  of  a  received  signal  (same  laser),  or  from  different  lasers  at  the  same 
nominal  wavelength  (different  laser)  (2,3],  Hence  in  our  model  we  calculate  a  joint  PDF  for  the 
accumulation  of  each  class  of  interferer  onto  each  output  link  in  the  network.  Convolution  of  the  joint 
PDFs  of  the  appropriate  links  traversed  by  a  signal  determines  the  PDF  of  the  total  crosstalk  accumulation 
over  an  end-end  path.  This  methodology  takes  into  account  all  possible  wavelength  assignments  for  a 
fixed  set  of  input  parameters,  and  runs  very  quickly  on  a  PC.  We  have  used  the  model  to  investigate  the 
effects  on  the  crosstalk  accumulation  PDF  (over  a  worst-case  path)  as  a  function  of  link  utilization,  no.  of 
wavelength  channels  per  fibre,  nodal  connectivity  and  choice  of  routing  algorithm  [1]. 

3.  Calculation  of  Confidence  Limits  for  Acceptable  Network  Performance:  Recently  [4]  we  presented 
results  on  homodyne  beat  noise  effects  showing  that  the  effect  of  laser  wavelength  referencing  error  and 
drift  is  to  reduce  the  total  beat  noise  power  from  a  fraction  2,N.10B|°  of  the  incident  power  to  SF.N.10e/1°, 
where  SF  is  a  scaling  factor,  varying  between  0  and  2,  e  is  the  component  crosstalk,  and  N  is  the  number 
of  interfering  crosstalk  terms,  assumed  to  originate  from  different  laser  sources.  For  a  typical  laser 
misalignment  of  0.1  nm,  Hie  scaling  factor  has  values  of  0.1  and  0.4  at  bit-rates  of  2.5  Gbs'1  and  10  Gbs'1, 
respectively.  In  the  presence  of  a  further  M  crosstalk  interferers  originating  from  the  same  laser  as  the 
signal,  the  total  beat  noise  power  becomes  a  fraction  SF.N.10E/1°  +  2M.10B|°.  In  our  simulations  we  have 
found  that  different  laser  crosstalk  from  the  space  switch  is  dominant.  Hence,  for  reasons  of  brevity,  in 
this  paper  we  present  a  simplified  model  for  the  confidence  limits,  assuming  component  crosstalk  levels 
to  be  equal  for  the  combined  demux-mux  characteristic  and  the  space  switch,  and  treat  all  interferers  to  be 
from  different  lasers. 
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At  high  power  penalties  the  bit  error  rate  (BER)  can  be  approximated  by  considering  only  the  tail  under 
the  PDF  of  a  received  one  bit  at  the  receiver  system.  For  the  purposes  of  this  simplified  calculation  the 
joint  PDF  for  different  and  same  laser  crosstalk  accumulation  is  converted  to  a  1-D  PDF,  denoted  Pn(N). 
The  BER  then  depends  on  N,  the  total  number  of  crosstalk  interferers  accumulated  from  the  network 
model,  and  X,  the  number  of  channels  out  of  this  N,  in  which  a  ‘1*  bit  is  arriving  at  the  receiver  at  the 

decision  time.  where  Q  is  the  Q -function  integral,  P  is  the 

adjustment  in  the  power  signal  required  to 
(1)  overcome  crosstalk  interference,  and  a2  is  the 
receiver  noise  in  the  absence  of  crosstalk.  Our 
network  model  produces  a  discrete  PDF  for  the 


ber~-q\ 


(1+P)(l  +  (2X- 


0.5N)10^°) 


2^2<t2  +  (1  +  p)22X(SF/2)lO^° 


statistics  of  N.  The  statistical  properties  of  the  BER  depend  upon  the  statistical  properties  of  the  terms 
2X -0.57V  and  2X ,  which  we  call  m  and  v  respectively:  m  and  v  denote  the  relative  change  in  mean 
and  variance  in  the  Q-function  argument,  for  a  1  bit  received.  Calculating  the  joint  PDF  P(m,v )  for 
these  two  random  variables,  determines  the  likelihood  of  m  and  v  attaining  a  given  value: 


P(m,v ) 


\2(v-m) 


(2(v -/»))! 
(3v/2-2m)!.(v/2)! 


P„(2(v-m)) 


(2)  where  PN  is  the  1-D  crosstalk 
accumulation  PDF. 


Figure  1 :  Probability  of  acceptable  system 
performance  vs  power  penalty 


For  given  values  of  m  and  v,  the  power  penalty  may  be 
calculated  from  eqn.  (1).  In  fact,  for  a  given  power 
penalty,  P  =  101og(l  -F  P) ,  eqn.  (1)  defines  a  contour 
in  the  (m,v)  plane.  For  all  values  of  m  and  v  on  the 
appropriate  side  of  this  contour,  the  BER  will  be  less 
than  or  equal  to  the  required  value.  Hence,  by  summing 
the  PDF  of  m  and  v  for  all  points  on  the  appropriate 
side  of  the  contour,  we  may  calculate  the  probability 
that  the  system  BER  is  maintained  or  bettered.  This 
calculation  takes  into  account  all  possible  network 
configurations.  Figure  1,  shows  the  results  of  such  a 


calculation  assuming  a  target  BER  of  10'12,  a  noise  power  scaling  factor  of  0.4  (corresponding  to  a 
lOGbs'1  data  rate  and  an  RMS  laser  wavelength  misalignment  of  O.lnm)  and  as  a  function  of  the 
component  crosstalk.  The  results  indicate  that  for  component  crosstalk  of  ~40dB,  the  probability  that  the 
system  will  perform  acceptably  exceeds  99  %  for  a  power  penalty  of  only  0.7  dB.  The  results  also 
indicate  that  a  small  change  of  0.2dB  in  the  power  penalty  is  sufficient  to  bring  the  confidence  limit  down 
to  50  %.  The  results  also  show  the  confidence  limits  to  be  very  sensitive  to  the  crosstalk  level. 


4.  Conclusion:  We  have  described  novel  probabilistic  methods  to  analyze  the  accumulation  of 
impairments  in  wavelength-routed  networks.  We  have  used  the  model  to  calculate  confidence  limits  for 
acceptable  bit  error  rate  performance  in  the  presence  of  crosstalk  from  routing  elements  such  as  optical 
switches.  Our  calculation  takes  into  account  all  possible  network  configurations  and  laser  wavelength 
misalignment  due  to  drift  and  referencing  error. 
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I.  Network  simulation  case  study 

This  paper  investigates  optical  transmission  impairments  in  a  metropolitan  WDM  ring  network  using 
simulation.  We  find  that  these  networks  can  lead  to  paths  long  enough  where  non-linear  effects  can 
become  noticeable  and  should  be  properly  engineered  to  balance  the  effects  of  both  receiver  electrical 
noise  as  well  as  fiber  nonlinearities.  This  becomes  increasingly  important  for  the  design  of  WDM 
networks  in  the  emerging  metro  market. 

Fig.  1  presents  the  architecture  of  our  metro  network  simulation  case  study  which  consists  of  optical 
network  elements  arranged  into  two  interconnected,  self-healing  rings.  Each  ring  consists  of  six  bi¬ 
directional  parallel  Wavelength  Add/Drop  Multiplexers  (WADMs)  [1]  connected  with  duplex  standard 
fiber  links.  Two  2x2  Wavelength  Selective  Cross-connects  (WSXCs)  are  used  to  provide  dynamic 
wavelength  routing  between  the  two  rings.  All  fiber  links  are  assumed  equal  in  length  (20  km)  with  a 
0.4  dB/km  total  attenuation  (includes  installation  splices,  fiber  attenuation  and  connectors).. 


Fig.  l:  Metro  WDM  interconnected  ring  simulation  case  study  with  examined  worst-case  optical  path. 


Thirty-two  channels  spaced  at  100  GHz  (1536.6-1561.4  nm)  are  used  in  the  network.  All  amplifiers 
within  the  WADMs  and  WSXCs  are  identical  with  an  effective  noise  figure  of  7.8  dB  that  takes  into 
account  WADM  interstage  losses.  The  gain  ripple  for  each  amplifier  has  a  sinusoidal  shape  with  a  1  dB 
peak-to-peak  value.  The  above  network  is  decomposed  into  worst-case  paths  and  the  one  that 
represents  the  absolute  worst-case  for  most  of  the  network  impairments  (non-linear  effects,  ASE  noise 
and  crosstalk)  is  chosen  in  Fig.  1  (path  6-7-14-7-8-9-10-11-12-13-14-1-2-3-4-5).  This  long  path  was 
intentionally  routed  through  both  WSXCs  for  crosstalk  enhancement.  Simulation  results  show  that 
optimum  system  performance  is  obtained  for  an  average  input  power  per  span  of  1  dBm/ channel.  For 
channel  powers  less  than  that,  the  performance  is  affected  by  electrical  noise  at  the  receiver,  whereas  for 
launched  powers  above  ldBm  non-linear  effects  (mainly  XPM)  become  important.  All  simulation 
parameters  are  shown  in  Table  I. 


II.  Results-Discussion 

Figure  2  presents  results  based  on  the  Q-factor  for  the  above  worst-case  path  obtained  using  our  link 
simulation  tool  and  assuming  externally  modulated  transmitters  with  zero  chirp.  In  Fig.  2  (a)  we 
observe  that  around  the  strongest  channels  (#  4-14)  the  shape  of  the  curve  is  flat  because  the  channel 
performance  is  affected  mostly  by  XPM.  The  weakest  channel  (Le.  channel  #25)  is  affected  by  receiver 
electrical  noise  since  its  power  has  been  reduced  due  to  the  accumulated  amplifier  ripple.  All  thirty-two 
channels  have  performance  better  than  the  target  BER  of  1(H2  (Q  >=8.47  dB)  however  their  performance 
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varies  by  about  3.5  dB.  Included  are  Q- penalties  due  to  crosstalk  contributions,  waveform  distortion 
due  to  filter  concatenation  and  other  transmitter/receiver  imperfections  (ldBQ  for  each  of  these 
impairments)  resulting  in  a  system  margin  of  0.5  dB  for  the  weakest  channel,  barely  above  the  required 
8.47  dB  (Fig.  2  (a).). 

Table  1:  Network  annotation  parameters 


Bit  rate  (Gbps) 

2J 

Signal  wsvefeafhti  (am) 
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Ouumd  spacing  (GHz) 
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Fig.  2:  Q-factor  (dB)  for  worst-case  path  of  Kg.  1;  (a)  assuming  1  dB  sinusoidal  gain  ripple  per  amplifier;  (b)  same  as  (a)  but 
increasing  the  amplifier  gain  by  0.5  dB ;  (c)  same  as  (a)  but  reducing  the  ripple  of  each  component  to  0 .5  dB.  Solid  squares:  sinusoidal 
shape  amplifier  gain  ripple;  Open  squares;  flat  amplifier  gain. 

To  improve  the  performance  of  the  network  we  can  increase  the  gain  of  each  amplifier  so  that  the 
weakest  channels  are  not  affected  by  electrical  noise  at  the  receiver,  or  improve  the  ripple  characteristics 
of  the  amplifiers.  The  former  approach  is  effective  as  shown  in  Fig.  2(b)  where  we  get  a  2  dB 
improvement,  but  presents  the  challenge  that  increased  amplifier  gain  will  introduce  more  penalties 
from  non-linear  effects  (mostly  XPM)  on  the  strong  channels,  and  the  optimum  point  will  be  difficult  to 
find.  Reducing  the  ripple  characteristics  of  the  amplifiers  is  also  an  effective  approach  for  improving 
network  performance.  Fig.  2  (c)  shows  the  more  drastic  effect  of  reducing  the  amplifier  gain  ripple  to 
0.5  dB  resulting  in  performance  improvement  of  1.8  dB. 

In  the  above  case  study,  crosstalk  level  requirements  can  also  be  set  for  each  elementary  component. 
This  study  focused  on  common-channel  crosstalk  which  is  generated  at  the  WSXCs.  The  path  of  Fig.  1 
will  accumulate  four  first-order  crosstalk  terms  corresponding  to  the  number  of  times  the  signal  goes 
through  a  WSXC  as  well  as  a  number  of  higher-order  terms  which  for  this  study  are  neglected.  Using  a 
model  similar  to  the  one  presented  in  [2],  details  of  which  will  be  presented  at  the  conference,  the  Q- 
penalty  due  to  crosstalk  can  be  evaluated  for  different  individual  crosstalk  levels  e  of  the  WSXCs. 

The  above  work  provides  useful  design  information  regarding  trade-offs  among  different 
transmission  impairments  in  the  network.  It  was  shown  that  for  worst-case  paths  in  a  metro  optical 
network,  amplifier  ripple  can  introduce  significant  power  divergence  among  channels.  In  such  case, 
non-linear  fiber  transmission  impairments  may  become  significant  for  the  strongest  channels  whereas 
electrical  noise  at  the  receiver  will  affect  the  weakest  channels.  Balancing  the  above  effects  by  properly 
engineering  these  paths  in  the  network  will  be  very  important  for  obtaining  the  best  possible  network 
performance. 
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Abstract 

The  computer  industry  has  been  proudly  riding  on  the  exponential  curve  based  on 
Moore’s  law  for  decades.  What  Moore  did  not  expect,  and  perhaps  nobody  has  ever 
dreamt  of,  is  that  the  photonic  technology,  especially  its  application  in  cable  industry, 
has  similar  if  not  more  profound  impacts  on  the  communication  infrastructure  and 
service  delivery  mechanism. 

This  presentation  will  discuss  the  evolution  of  cable  infrastructure  from  the  point 
of  view  of  a  Photonic  Moor ’s  Law  that  is  based  on  the  continuous  technology  innovation 
in  lightwave,  RF,  and  digital  processing.  We  will  describe  the  new  exponential  curve  that 
represents  this  evolution,  starting  from  conventional  Fiber-to-the-Serving-Area  with  fiber 
node  segmentation,  through  DWDM-based  Secondary  Ring  architecture,  to  the 
LightWire  network  with  fully-passive  coax  plant,  and  to  the  CoraLight  architecture 
based  on  digital  and  distributed  processing  platform  with  DWDM  deployment  to  the  last 
thousands  feet. 
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Broadband  companies  have  started  deploying  DWDM  networks  for  the  transport  of  narrowcast  (NC)  services  such 
as  telephony,  high-speed-data  and  video-on-demand.  In  this  paper  we  compare  the  performance  of  four  different 
broadband  DWDM  architectures.  All  four  architectures  describe  a  broadband  DWDM  overlay  system  between  the  HE 
and  the  hub  for  the  transport  of  NC  services.  Separate  fibers  are  required  for  the  DWDM  overlay  and  for  the  broadcast 
(BC)  transport  since  otherwise  the  SRS-induced  crosstalk  would  destroy  the  signal  quality  of  the  predominantly  analog 
BC  video  channels.  At  the  hub,  each  of  the  demultiplexed  wavelengths  carrying  the  NC  signal  is  routed  to  a  specific 
node.  The  differences  in  the  architectures  lie  in  the  way  the  BC  and  NC  signals  are  combined  prior  to  distribution  over 
the  coax  plant.  The  four  architectures  are  shown  in  Figures  1  to  4. 

Figure  1  shows  a  simple  single-node-receiver  (SNR)  design.  In  order  to  meet  a  signal  quality  of  at  least  50  dB 
carrier-to-noise  ratio  (CNR)  at  the  node  receiver,  the  BC  path  (50  -  550  MHz)  employs  an  externally-modulated  1550  nm 
laser  from  the  HE  to  the  node.  The  NC  lasers  at  the  HE  are  directly-modulated,  1550  nm  ITU-compliant  lasers  carrying 
NC  services  above  550  MHz.  The  demultiplexed  NC  signal  is  optically  combined  (using  a  simple  1x2  coupler)  with  the 
BC  signal  at  the  hub  and  transported  to  the  optical  node  over  a  single  fiber.  A  single  receiver  at  the  node  detects  both  the 
BC  and  NC  optical  signals.  The  addition  of  the  NC  optical  signal  to  the  BC  signal  at  the  node  receiver  will  degrade  the 
CNR  performance  of  the  BC  path  (and  vice  versa).  Combining  both  the  BC  and  NC  optical  paths  together  at  a  single 
receiver  results  in  excess  shot  noise  and  relative  intensity  noise  (RIN).  The  degradation  of  the  BC  signals  increases  as  the 
average  optical  power  of  the  NC  signal  is  increased.  Even  more  severe  consequences  are  composite  second-order  (CSO) 
and  composite  triple  beat  (CTB)  distortion  products  generated  in  the  NC  path  that  result  from  laser  chirp  of  directly 
modulated  ITU  lasers  interacting  with  high  fiber  dispersion  at  1550  nm.  If  the  bandwidth  of  the  NC  signals  exceeds  50 
MHz  then  the  CSO  distortion  products  of  the  NC  signal  will  degrade  the  lower-frequency  analog  channels  in  the  BC 
band. 

Figure  2  shows  a  dual-node-receiver  (DNR)  design  in  which  the  BC  and  NC  optical  signals  travel  over  separate 
fibers  and  are  detected  by  separate  node  receivers.  They  are  then  combined  together  in  a  forward  combining  filter  that 
acts  as  a  low-pass  filter  for  the  BC  signals  and  a  high-pass  filter  for  the  NC  signals.  Thus,  noise  and  distortion  products 
present  at  the  output  of  the  NC  receiver  are  filtered  out  and  do  not  affect  the  performance  of  the  BC  signals,  and  vice- 
versa.  In  addition,  the  performance  of  each  path  can  be  individually  optimized  without  affecting  the  other.  This  is  the 
optimum  architecture  but  requires  a  second  receiver  and  fiber  as  well  as  a  forward  combining  filter  at  each  node. 

Figure  3  shows  a  dual-hub-receiver  (DHR)  design,  a  variation  of  the  DNR  design  in  which  the  BC  and  NC 
receivers  and  the  forward  combining  filter  are  all  located  at  the  hub.  An  overlay  13 10  nm  DFB  laser  is  then  used  to 
transport  combined  BC/NC  signals  from  the  hub  to  the  node. 

Figure  4  shows  a  hybrid-distributed-receiver  (HDR)  design,  an  interesting  variation  that  MediaOne  has  deployed 
in  one  of  its  systems.  In  this  design,  the  NC  signal  is  detected  at  the  hub,  high-  pass  filtered  (to  filter  out  noise  and 
distortion  products  that  would  otherwise  fall  on  top  of  the  BC  signal)  and  transported  to  the  node  over  an  overlay  1310 
nm  link.  A  coarse  WDM  coupler  is  used  to  combine  the  1550  nm  BC  and  1310  nm  NC  signals  onto  the  same  fiber.  At 
the  node,  the  combined  signals  are  detected  using  a  single  receiver.  This  architecture  is  therefore  a  hybrid  of  the  single 
and  dual-receiver  designs. 

The  overall  performance  of  each  of  the  above  DWDM  designs  were  modeled.  The  modeling  took  into  account  the 
major  impairments  in  the  system  including  shot  noise,  thermal  noise,  laser  intensity  noise,  chirp-induced  CSO  and  CTB 
and  EDFA  noise.  The  modeling  assumed  a  HE-to-hub  distance  of  100  km  and  a  hub-to-node  distance  of  10  km.  This 
closely  approximates  the  twenty-optical-channel  DNR-DWDM  experimental  system  in  MediaOne  Labs.  The  BC 
bandwidth  is  assumed  to  be  from  50  -  550  MHz  (78  analog  video  channels)  and  the  NC  band  is  above  550  MHz.  We 
have  also  assumed  that  the  RF  power  of  each  NC  channel  is  -5  dBc  (referenced  to  broadcast  video  carrier  level)  and  that 
the  optical  modulation  index  of  the  channels  have  been  optimized  so  as  to  avoid  laser  clipping.  The  purpose  of  the 
modeling  was  to  predict  the  maximum  NC  bandwidth  capability  of  each  DWDM  design. 

As  expected,  the  DNR  design  provided  the  best  performance,  supporting  a  full  310  MHz  complement  of  NC  256- 
QAM  channels  from  550  MHz  to  860  MHz.  The  DHR  design  also  provides  for  a  3 1 0  MHz  NC  payload  except  that  the 
effective  SNR  values  are  reduced  because  of  the  presence  of  the  13 10  nm  link.  In  a  SNR  design,  the  NC  bandwidth  is 
limited  to  50  MHz  due  to  the  effects  of  chirp-induced  distortion  products  previously  mentioned. 

The  performance  of  the  HDR  design  lies  between  that  of  the  SNR  and  DNR  designs.  The  presence  of  the  high- 
pass  filter  after  the  NC  receiver  in  the  hub  means  that  chirp-induced  CSO  in  the  NC  path  is  filtered  out  prior  to  being 
combined  with  the  BC  signal.  Both  theoretical  results  and  experimental  evidence  indicates  that  this  hybrid  architecture 
gives  adequate  performance  (defined  as  a  BC  CNR  of  50  dB  and  a  NC  SNR  of  38  dB)  as  the  number  of  NC  256-QAM 
channels  are  increased  to  25  (150  MHz).  This  is  more  than  sufficient  narrowcast  bandwidth  for  the  next  several  years. 
Considering  that  a  NC  bandwidth  of  150  MHz  is  equivalent  to  a  data  rate  of  975  Mb/s  (using  256-QAM)  this  is 
equivalent  to  providing  each  home  (in  a  250-home  node  size)  with  a  dedicated  bandwidth  of  3.9  Mb/s.  Assuming  a  more 
0-7803-6252-7/00/$  1 0.0002000  IEEE  23 


realistic  take  rate  of  50  %  and  a  usage  factor  of  ten,  the  typical  data  rate  provided  to  each  active  subscriber  is  greater  than 
this  by  a  factor  of  20,  or  78  Mb/s.  This  is  well  above  what  can  be  provided  by  competing  DSL  or  even  VDSL  providers. 

In  conclusion,  experimental  results  illustrate  the  markedly  superior  technical  performance  of  the  dual-receiver 
DWDM  designs  in  accordance  with  our  modeling  predictions.  A  simple  single-receiver  implementation  is  adequate  for 
rapid  initial  deployment  of  a  DWDM  system  with  NC  bandwidth  requirements  of  less  than  50  MHz.  As  narrowcast 
bandwidth  requirements  exceed  50  MHz,  a  narrowcast  receiver  and  high-pass  filter  can  be  added  in  the  hub  to  convert  the 
system  into  a  HDR  design.  This  design  can  support  a  narrowcast  bandwidth  up  to  1 50  MHz  (twenty  five  256-QAM 
channels)  which  provides  more  than  sufficient  bandwidth  for  the  foreseeable  future.  If  narrowcast  bandwidth  exceeding 
150  MHz  is  required  someday,  then  the  system  can  be  upgraded  to  one  of  the  two  flavors  of  dual-receiver  designs.  In  the 
upstream  direction,  either  a  10-bit  digital  reverse  or  a  4-channel  block  conversion  scheme  in  conjunction  with  DWDM 
results  in  the  most  cost-effective  return  path  solution.  Experimental  results  on  both  return-path  techniques  are  presented. 


PDN 

Ntxfc 


Headend  Hub/OTN/CEC 


Figure  4  -  Hybrid-distributed-receiver  (HDR)design. 
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Evaluation  of  SuperPON  demonstrator 
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Abstract —  The  paper  presents  evaluation  results  of 
the  SuperPON  demonstrator.  It  shows  the  feasibility 
of  a  Passive  Optical  Network  with  increased  bitrate 
(2.5  Gbit/s  downstream  and  311  Mbit/s  upstream), 
extended  range  (100  km)  and  enlarged  splitting  factor 
(2048). 

I.  Introduction 

The  end-to-end  performance  of  the  SuperPON  (Su¬ 
per  Passive  Optical  Network)  demonstrator  has  been 
evaluated.  The  SuperPON  system  is  an  optical  ATM 
access  network,  which  can  support  a  large  number 
(2048)  of  ON  Us  (Optical  Network  Units)  and  cover  a 
long  distance  (100  km) .  To  overcome  the  large  losses, 
optical  amplifiers  are  introduced.  EDFAs  (Erbium- 
Doped  Fiber  Amplifier)  are  used  for  2.5  Gbit /s  down¬ 
stream  TDM  (Time  Division  Multiplexing)  transmis¬ 
sion  at  1550  nm  towards  the  subscribers  and  SOAs 
(Semiconductor  Optical  Amplifiers)  for  311  Mbit/s 
upstream  TDMA  (Time  Division  Multiple  Access) 
transmission  at  1310  nm  towards  the  Access  Node. 
While  a  first  lab  model  was  restricted  to  the  evalu¬ 
ation  of  the  optical  transmission  part  of  SuperPON 
[1].  the  present  demonstrator  [2]  was  realised  with  a 
dedicated  chipset  for  the  SuperPON  transport  sys¬ 
tem  [3].  The  performance  of  the  optical  layer  was 
further  improved  by  a  newly  designed  BMRx  (Durst 
Mode  Receiver)  [4]  and  the  use  of  reflective  SOAs  [5]. 

An  earlier  paper  already  described  standalone  tests 
on  the  uplink  consisting  of  the  BMRx  and  a  cascade 
of  SOAs  [4],  The  present  paper  reports  on  the  in¬ 
tegration  of  the  transport  system  enabling  ASICs  in 
the  system  and  the  end-to-end  performance  in  down¬ 
stream  and  upstream  direction  between  the  service 
interfaces  at  the  Line  Termination  (LT)  and  ONU. 

II.  SuperPON  setup 

The  evaluation  setup  is  shown  in  figure  1.  Two 
ON  Us  sit  uated  in  two  different  branches  are  used  to 
evaluate  the  performance  of  the  system. 

The  downstream  communication  consists  of  a  con¬ 
tinuous  bitstream  with  constant  power  level  that  is 
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Fig.  1.  SuperPON  setup  for  the  evaluation. 


broadcast  to  all  ONUs.  After  the  feeder  section,  the 
signal  arrives  at  the  Amplified  Splitter,  where  a  first 
splitting  is  achieved,  together  with  optical  amplifi¬ 
cation.  The  signal  then  passes  via  the  drop  section, 
where  a  second  splitting  stage  is  situated,  and  finally 
arrives  at  the  ONU. 

The  upstream  communication  consists  of  a  sequence 
of  data  cells  coming  from  different  ONUs  and.  hence, 
exhibits  bursty  variations  of  the  power  level.  The  two 
first  stages  of  SOAs  (SOA1  and  SOA2)  are  switched 
on/off  on  a  cell  by  cell  basis  in  order  to  reduce  noise 
accumulation  from  amplifiers  in  parallel  branches, 
while  the  SOA  common  for  the  different  branches 
(SOA3)  operates  in  continuous-wave  mode.  A  SOA 
has  a  fast  response  so  that  the  signal  is  not  corrupted 
by  slow  settling  of  the  gain  due  to  abrupt  changes  of 
the  input  power.  The  use  of  reflective  SOAs  [5]  in 
the  first  two  stages  decreases  the  polarization  depen¬ 
dency  of  the  gain,  wliicli  is  only  2.5  dB  for  4  SOAs 
hi  cascade. 

For  the  reception  of  the  upstream,  an  improved 
BMRx  with  an  offset  compensation,  a  gain  control 
on  a  burst  by  burst  basis  and  a  duty  cycle  compen¬ 
sation  has  been  designed  [4]. 

A  dedicated  chipset  has  been  developed  that  provides 
the  necessary  operations  for  the  SuperPON  transport 
system.  The  functionality  is  very  similar  to  a  stan¬ 
dardised  APON  system  [6],  but  in  the  present  case 
supports  the  long  round  trip  delay  (corresponding  to 
100  km),  the  identification  of  2048  ONUs,  and  in¬ 
creased  bitrates  (2448/311  Mbit/s). 

The  SuperPON  Transport  System  Protocol  initialises 
the  system  and  provides  the  operation  and  mainte¬ 
nance  during  normal  operation.  It  puts  the  different 
thresholds  and  gains  of  the  BMRx  and  the  SOAs,  de¬ 
pending  on  the  power  level  of  the  upstream  cells  sent 
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by  the  respective  OKU.  The  STSP  synchronises  each 
ONU  to  ensure  a  proper  operation  of  the  TDMA  and 
allocates  the  bandwidth  per  connection.  It  handles 
the  pulse  width  correction  at  the  LT  and  the  power 
leveling  of  the  ONUs. 

In  order  to  evaluate  the  system,  a  BER  tester  Hr 
E42108  was  connected  to  the  LT.  It  generates  an 
ATM  cell  stream  with  PRBS-23  data  pattern  for  each 
ONU.  This  traffic  is  sent  to  the  LT  via  STMI  inter¬ 
faces  of  155  Mbit/s  and  is  multiplexed  by  the  LT  into 
the  2.5  Gbit/s  downstream  of  SuperPON.  In  order  to 
evaluate  the  complete  path  (downstream-upstream), 
the  data  traffic  is  loopbac.ked  in  the  ONUs  and  is  re¬ 
ceived  al.  the  LT  side,  which  sends  it  bach  to  the  BER. 
tester  for  verification. 

In  order  to  measure  the  allowed  optical  budget  and 
dynamic  range  in  the  drop  section,  power  meters  and 
tunable  optical  attenuators  have  been  inserted  in  the 
various  sections  of  the  system. 

III.  Performance 

Figure  2  shows  the  BER  in  function  of  the  average 
input  power  at  the  BMRx  (in  dBm)  for  one  ONU. 
A  BER  smaller  than  10~9  is  obtained  for  an  average 
input  power  at  the  BMRx  varying  from  -19  dBm 
to  -32  dBm  (in  order  to  study  the  behaviour  of  the 
BMR,x,  all  the  SOAs  are  removed  from  the  setup). 


Fig.  2.  BER  for  one  ONU. 

Figure  3  shows  the  region  with  a  BER  smaller  than 
10-9  for  the  average  input  power  at  the  BMRx  com¬ 
ing  from  the  two  ONUs.  The  graph  is  symmetric 
with  regard  to  the  diagonals,  showing  that  the  two 
branches  are  equivalent.  The  region  looks  like  a  trun¬ 
cated  square,  in  which  the  truncated  regions  are  sit¬ 
uated  at  extreme  differences  of  average  input  power 
from  the  two  ONUs. 

IV.  Discussion  and  conclusion 

This  paper  reports  results  of  the  end-to-end  per¬ 
formance  of  a  SuperPON  demonstrator  with  trans- 


Average  input  pnwer  at  the  BMRX  from  NTH  (dBm) 


Fig.  3.  BER  for  two  ONUs. 


port  system  enabling  ASICs.  The  measured  dynamic 
range  of  13  dB  at  the  BMRx  is  sufficient  in  such  a 
system  where  optical  losses  and  amplifier  gains  may 
become  very  large.  It  is  because  the  excess  of  varia¬ 
tions  in  the  optical  budget  is  compressed  on  one  hand 
by  the  gain  saturation  of  the  SOAs  and  on  the  other 
hand  by  the  control  of  the  gain  on  a  cell  by  cell  basis 
[4].  The  losses  in  the  drop  section  allow  to  support 
a  split  of  64  and  a  drop  length  of  10  km.  In  com¬ 
bination  with  a  split  of  32  in  the  amplified  splitter 
and  a  feeder  of  90  km,  a  total  of  2048  ONUs  can  be 
serviced  at  a  range  of  up  to  100  km.  The  prototype 
system  is  installed  in  a  technical  field  trial  and  inte¬ 
grated  with  an  optical  core  network  in  the  frame  of 
ACTS  PELICAN  during  the  first  half  of  this  year. 
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Spectral  slicing  of  spontaneous  emission  sources  has  been  proposed  to  deliver  broadcast  services  on  WDM 
passive  optical  networks  [1].  Recent  advances  in  ultra-high  power  LEDs  makes  spectral  slicing  a  more  promising 
solution  for  fiber  to  the  home  (FTTH)  broadband  access  systems.  However,  the  output  from  a  spontaneous  emission 
source  contains  spontaneous-spontaneous  emission  beat  noise  [2],  also  called  "excess  intensity  noise".  The  signal  to 
spontaneous-spontaneous  emission  beat  noise  ratio  is  proportional  to  the  ratio  of  the  optical  bandwidth  of  the  optical 
source  to  the  electrical  detection  bandwidth  [3],  With  the  new  requirements  of  delivering  high  speed  broadband  data 
in  access  networks  and  the  introduction  of  closer  and  narrower  bandwidth  optical  wavelength  channels,  the  effects 
of  spontaneous-spontaneous  emission  beat  noise  becomes  more  pronounced  in  spectrally  sliced  WDM  systems.  In 
this  paper,  we  experimentally  study  the  performance  of  broadband  data  delivery  on  WDM  systems  using  spectrally 
sliced  spontaneous  emission  sources. 

Fig.  1  shows  the  experimental  setup. 

An  ultra-high  power  LED  is  directly 
modulated  with  a  broadband  27-l  PRBS 
sequence.  The  output  from  the  LED  is 
transmitted  through  a  spool  of  5km  All- 
Wave™  fiber  and  spectrally  sliced  using  a 
commercial  thin  film  wavelength  add/drop 
multiplexer  with  0.55nm  3-dB  optical 
bandwidth  before  being  detected  with  a 
photodetector.  Both  PIN  and  APD  detectors 
are  used  in  the  experiment. 

We  measured  the  bit  error  rate 
(BER)  versus  received  optical  power  at 
622Mbps  (OC-12  speed)  and  1.25Gbps 
(Gigabit  Ethernet  speed).  The  results  are 
plotted  as  dots  in  Fig.  2  and  Fig.  3.  In  spite  of  the  fact  that  a  0.55nm  wide  optical  signal  was  detected,  there  was 
hardly  any  dispersion  penalty  due  to  the  5km  fiber  link. 

The  BER  is  determined  by  the  signal  to  noise  ratio.  For  the  PIN  receiver,  the  noise  variation  is  given  by: 

o‘„=^-  +  2qIB,+(l  +  p)^P  (1) 

The  three  terms  represent  thermal  noise,  shot  noise  and  spontaneous-spontaneous  emission  beat  noise.  In  (1)  R  is 
the  receiver  load  resistance,  /  the  photocurrent,  Be  and  B0  the  receiver  electrical  bandwidth  and  optical  bandwidth  of 
the  spectrum  slice.  The  factor  p  (0  <p  <  1)  in  (1)  represents  the  degree  of  polarization  of  the  received  optical  signal. 
For  a  completely  unpolarized  signal,  p  =  0  and  for  a  pure  polarized  signal,  p  =  1 .  The  optical  signal  at  the  receiver 
end  was  measured  to  be  70%  to  80%  polarized  in  our  experiment,  which  was  expected  from  an  edge  emitting  LED 
device.  The  dark  current  is  neglected. 

For  APD  receivers,  the  shot  noise  and  spontaneous-spontaneous  emission  beat  noise  are  enhanced  so  that 
a  1TB  B 

<„  =-T^+1NFAPD{M)qIB,+NFArD(\  +  p)-J-P  (2) 

where  M  is  the  avalanche  multiplication  factor  of  the  APD  detector  and  NFAPD=(M2)/(M )2  is  the  APD  noise  factor 
[4],  Therefore,  although  the  detected  photocurrent  has  been  amplified  by  a  factor  of  (A/),  the  optical  noise  is  also 
enhanced.  The  continuous  lines  in  Fig.  2  and  Fig.  3  are  the  computed  BER  assuming  Gaussian  statistics  for  the  total 
noise  at  622Mbps  and  1.25Gbps  respectively,  using  (1)  and  (2).  The  dashed  lines  in  Fig.  2  and  Fig.  3  are  the 
calculated  BER  using  the  same  parameters  but  with  the  spontaneous-spontaneous  emission  beat  noise  turned  off. 
This  corresponds  to  the  situation  when  a  laser  diode  having  low  RJN  noise  or  when  an  unsliced  spontaneous 
emission  source  with  very  broad  bandwidth  is  used. 


DC  bias 
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As  seen  from  the  dashed  lines  in  Fig.  2  and  Fig.  3,  in  the  absence  of  spontaneous-spontaneous  emission 
beat  noise,  APD  receivers  have  an  advantage  over  PIN  receivers,  due  to  avalanche  multiplication  gain.  When  the 
optical  noise  is  considered,  this  is  still  the  case  at  low  optical  power,  i.e.  when  /  is  small  in  (1)  and  (2).  As  received 
optical  power  increases,  both  the  PIN  and  APD  receiver  performance  relative  to  the  thermal  noise  only  case 
degrades,  with  the  APD  degrading  more  rapidly  due  to  the  multiplicative  terms  in  (2)  which  increases  the 
importance  of  the  shot  noise  and  spontaneous-spontaneous  emission  beat  noise.  Considerations  of  (1)  and  (2) 
indicates  a  floor  at  SNR=t]B0/Be(l+p),  where  rj=l  for  PIN  and  rj=NFAPD>\  for  APDs.  Thus,  we  can  see  from  Fig.  3 
that  the  APD  BER  quickly  reaches  an  error  floor.  At  622Mbps,  the  APD  receiver  achieved  a  1  O'9  BER  at  -34dBm 
sensitivity.  However,  for  1.25Gbps  data  rate,  the  APD  receiver  BER  flattened  at  about  -26dBm  received  power 
with  a  BER  slightly  larger  than  10‘9.  On  the  other  hand,  the  PIN  receiver  achieved  10'9  BER  at  both  622Mbps  and 
1.25Gbps  with  -24.5dBm  and-21.5dBm  received  optical  power,  before  reaching  the  error  floor  in  our  experiment. 


Fig.  2  BER  vs.  received  optical  power  for  Fig-  3  BER  vs.  received  optical  power  for 

PIN  and  APD  detectors  at  622Mbps  PIN  and  APD  detectors  at  1.25Gbps 

From  a  systems  viewpoint,  the  BER  crossover  point  between  APD  and  PIN  receivers  in  a  high  data  rate 
spectrally  sliced  DWDM  link  depends  on  the  gain  and  noise  figure  of  the  APD  receiver,  and  the  relative  ratio  of  the 
electrical  bandwidth  and  optical  channel  bandwidth.  When  the  optical  channels  are  narrow,  as  in  closely  spaced 
(e.g.  50GHz  or  25GHz  spacing)  systems,  it  is  possible  that  APDs  will  not  offer  any  advantage.  Fig.  2  and  Fig.  3 
also  showed  that  the  relative  performance  of  APD  and  PIN  receivers  depends  on  the  power  budget  and  affordable 
maximum  bit  error  rate.  If  forward  error  correction  (FEC)  is  used  to  enable  the  system  to  operate  with  a  small 
power  budget,  one  may  still  prefer  APD  receivers  and  let  FEC  to  take  care  of  the  error  floor. 

In  conclusion,  we  compared  the  performance  of  PIN  and  APD  detectors  for  broadband  data  delivery  using 
spectrally  sliced  spontaneous  emission  sources.  Under  spontaneous-spontaneous  emission  beat  noise  limitation, 
APD  receivers  may  lose  its  advantage  more  quickly  as  a  result  of  the  optical  signal  to  noise  degradation,  which 
arises  from  the  stochastic  avalanche  multiplication  effect. 
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Abstract :  We  present  a  discussion  of  the  impact  of  interferometric  noise  in  radio-over-fibre  delivery  networks, 
and  provide  rigorous  bit  error  rate  results  for  the  transmission  of  quadrature  amplitude  modulation  (QAM). 

Introduction  :  The  prospect  of  broadband  wireless  access  systems  provides  a  number  of  service  opportunities, 
for  instance,  wireless  local  area  networks  (WLANs)  or  fixed  wireless  access  systems.  Added  to  this,  the  cost 
and  reach  advantages  of  optical  delivery  of  the  radio  signals  it  is  fairly  obvious  that  this  technology  will 
have  a  huge  impact  on  the  access  network.  To  provide  the  required  high  bandwidth  and  frequency  reuse, 
it  is  envisaged  that  such  systems  will  operate  well  into  the  millimetre-wave  (mm-wave)  spectrum,  so  that 
even  optical  transport  of  these  signals  is  not  without  problems  due  to  the  high  frequencies  that  are  involved. 
To  over  come  the  main  restricting  factor,  dispersion,  single  sideband  (SSB)  techniques  have  been  adopted, 
which  effectively  eliminates  the  dispersion-induced  carrier  power  fluctuations,  allowing  operation  over  tens 
of  kilometres  [1] . 

A  number  of  methods  have  been  investigated  to  generate  the  optical  SSB  signal  carrier,  however  we  focus 
on  the  method  of  [2]  which  uses  carrier  suppression  to  create  the  two  carriers  required  for  self-heterodyne 
mm-wave  transmission.  We  consider  the  impact  of  interferometric  noise  (IN)  in  such  delivery  networks. 
IN  is  formed  when  the  desired  signal  is  received  with  other  lightwave  components  at  nominally  the  same 
wavelength.  The  components  may  emanate  from  distinct  laser  sources  or  may  be  replicas  of  the  desired  signal 
delayed  by  greater  than  the  coherence  time  of  the  source  laser.  These  crosstalk  signals  may  be  formed  due 
to  reflections  at  fibre  splices,  or  by  leakage  in  filtering  or  routing  elements.  Previous  studies  have  considered 
this  phenomenon  for  transmission  of  binary  OOK  data  on  radio-over-fibre  systems  [3].  However,  here  we 
extend  this  analysis  to  quadrature  amplitude  modulated  (QAM)  data,  which  is  a  more  usual  modulation 
scheme  for  access  links. 

Radio-over- Fiber  Transmission  :  The  method  of  [2]  creates  two  coherent  optical  spectral  lines  separated 


Figure  1:  Schematic  of  mm-wave  transmission.  DFB  =  DFB  Laser  Diode  Source,  MZM  =  Mach-Zehnder 
Modulator,  OF  =  Optical  Filter  PD  =  Photo-diode 


by  the  desired  millimetre  wave  frequency  using  a  single  laser  source  and  a  Mach-Zehnder  modulator  driven 
with  half  the  required  mm-wave  frequency.  Due  to  the  co-sinusiodal  nature  of  the  Mach-Zehnder  response, 
when  biased  at  the  point  of  minimum  transmission  (Vm as  =  K)  all  even  terms,  including  the  carrier  will 
be  suppressed,  with  first  sidebands  being  dominant.  It  is  then  possible  to  modulate  data  onto  one  of  the 
sidebands  for  transmission  to  the  base  station  as  shown  in  figure  1 
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Theoretical  Analysis  :  We  define  the  signal  space  of  a  M-ary  QAM  system  to  be: 


°°  /  t  _  l'I'\ 

cos(2ir fct)+  ^2  b(t)  rect  f — — — J  sin(27r/ct)  (1) 

l=  —  0O  '  ' 

where  fc  is  the  carrier  frequency,  T  the  signalling  interval,  and  rect(-)  is  the  rectangular  window  function.  We 
consider  that  the  signal  is  made  up  of  two  independent  and  identically  distributed  data  streams  a(t),b(t)  G 
{±7,  ±37,  •  •  •  ,  ±(2 N  -  1)7}  for  M-ary  QAM,  where  M  =  (2AT)2 

Therefore,  using  the  system  presented  in  figure  1,  for  uncorrupted  transmission  the  signal  at  the  photo-diode 
can  be  described  by: 

Et{t)  =  \fj?0  -2»/mm /2)t+*. (t)l  +  m(t)eil(^+27r/mm/2)t+0o(t)]j  (2) 

where  u0  and  <j>0{t)  are  the  optical  frequency  and  phase  respectively.  The  inclusion  of  interfering  terms 
will,  upon  detection,  create  noise  terms  that  will  affect  both  the  amplitude  and  phase  of  the  QAM  signal. 
The  noise  terms  have  been  rigorously  modelled  to  produce  a  moment  generating  function  (MGF)  for  the  i 
channel: 


*(«)  =  h  (sPrsfta{t))  +  Jo  (sPr^a]{t)  +  b}{t))  (3) 

where  e  is  the  relative  strength  of  the  interfering  terms  designated  with  the  subscript  i.  Using  the  modified 
Chernoff  bound  [4]  we  calculate  the  bit  error  rate  for  QAM  transmission  with  various  signal  amplitudes  7. 


Results  and  Discussion  :  In  figure  2  we  demon¬ 
strate  that  QAM  transmission  offers  an  im¬ 
provement  in  BER  over  a  standard  binary  chan¬ 
nel,  for  a  normalised  received  amplitude  signal, 
(decision  threshold  0.5).  This  is  predominately 
due  to  the  use  of  gray  coding  for  QAM  trans¬ 
mission,  which  places  certain  symbol  combina¬ 
tions  further  from  the  decision  threshold  than 
would  normally  be  the  case.  We  see  however 
that  QAM  transmission  is  subject  to  higher 
degradation  due  to  interferometric  noise  than 
would  be  the  case  in  standard  binary  ASK  trans¬ 
mission.  This  conclusion  is  logical,  as  interfer¬ 
ence  effects  not  only  the  amplitude,  but  also 
the  phase  of  the  received  signal,  which  is  a 
major  factor  in  the  performance  of  coherently 
demodulated  signals. 


Figure  2:  Bit  Error  Rate  evaluation  for  QAM 
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Introduction 

Over  the  past  several  years,  there  has  been  an  industry-wide  push  to  examine  the  combination  of  WDM  and 
optical  switching  technologies  to  provide  a  configurable  optical  layer  for  national  scale  and  metropolitan 
transport  applications.  Early  on,  the  vision  of  this  optical  layer  was  one  in  which  signals  were  transported 
transparently  through  a  national  scale  network  without  regeneration  along  the  way  [1,2].  After  preliminary 
experimental  work,  it  became  clear  that  the  transparent  reach  distances  over  which  WDM  signals  could  be 
transported  in  such  configurable  optical  networks  was  limited  by  physical  constraints  to  about  2000  km  [3], 
which  is  shorter  than  necessary  on  most  continents.  Recognition  of  this  led  to  two  alternate  proposals.  One 
became  known  as  the  “Opaque”  network  solution  because  it  used  regeneration  on  every  channel  at  every 
configurable  point  in  the  network  [4].  This  solves  the  problem  of  limited  reach,  but  introduces  many  more 
regenerators  into  the  network,  which  are  one  of  the  more  costly  components  of  the  equipment.  The  other 
approach  became  known  as  “Islands  of  Transparency”  because  smaller  transparent  sub-networks  were 
defined  that  were  connected  via  regeneration  sites  to  form  a  national  scale  network  [5],  This  approach 
solves  the  problem  of  expensive  regenerators,  and  allows  the  sub-network  size  to  be  tailored  to  the  reach 
length  that  can  be  achieved.  But  even  then,  a  reach  length  of  2000  km  is  not  enough  to  handle  the  WDM 
ring  sizes  that  would  be  needed  in  the  West  and  Southwest  regions  of  the  United  States. 

A  fresh  look  at  this  problem  indicates  another  potential  network  solution.  In  this  case,  each  wavelength 
path  in  the  network  is  routed  in  the  network  in  such  a  way  that  regenerators  are  placed  at  known  sites  and  at 
distances  that  are  consistent  with  the  reach  length  of  the  technology.  This  approach  minimizes  the  number 
of  regenerators,  but  requires  that  the  route  length  of  each  link  of  the  network  be  recorded  and 
communicated  to  adjoining  network  management  systems  each  time  a  new  wavelength  path  is  established. 

These  network  solutions  are  coming  in  response  to  rapidly  growing  demands  for  voice,  transaction  data, 
and  internet  traffic,  which  shape  the  new  WDM  network  topologies  and  approaches.  For  example,  the 
characteristics  of  internet  traffic  set  the  reach  length  needed  for  internet  backbone  applications,  and  the  cost 
savings  of  a  managed  reach  approach  are  linked  to  these  traffic  characteristics  and  to  the  technically 
achievable  reach  length.  The  United  States  long-distance  network  is  used  as  an  example  to  illustrate  these 
points.  It  indicates  that  for  internet  traffic,  a  managed  reach  network  with  achievable  reach  lengths  of  2500 
km  or  so  can  virtually  eliminate  redundant  regenerators  and  significantly  reduce  equipment  costs,  as 
compared  to  other  approaches. 

Traffic  Damand  Growth  and  tho  Value  of  Extended  Reach  Length 

A  comprehensive  traffic  demand  and  growth  study  has  been  performed  for  a  USA  network  consisting  of  46 
major  cities  connected  by  63  fiber  links  [6].  The  network  topology  is  similar  to  those  used  by  current  USA 
long-distance  carriers.  In  this  study,  revenue-bearing  traffic  demands  between  city  pairs  are  computed  for 
voice,  transaction  data,  and  internet  applications  separately,  and  the  growth  of  each  of  these  three 
components  of  the  traffic  estimated  from  available  trends.  These  traffic  demands  are  used  to  determine  the 
required  capacity,  taking  into  account  peak-to-average  ratios,  framing  overheads,  and  TDM  channel 
loading  inefficiencies.  Each  demand  pair  is  routed  in  the  network  using  the  shortest  distance  available,  and 
network  statistics  are  recorded  describing  the  number  of  channels  per  link,  the  number  and  location  of 
regenerator  sites,  the  number  of  channels  dropped  at  each  city,  and  the  total  number  of  channels  entering  a 
city.  These  are  used  to  understand  the  required  reach  length  and  the  network  economics.  The  aggregate 
network  traffic  demands  and  growth  rates  for  this  study  echo  the  well-known  point  that  internet  traffic,  if  it 
continues  to  grow  at  its  present  rate  of  about  150%  per  year,  will  dominate  the  network  in  a  few  years 

One  important  observation  from  the  traffic  studies  is  that  the  average  connection  distance  for  internet  traffic 
is  significantly  longer  than  that  for  voice  traffic.  A  span  distances  of  600  km,  as  is  ordinarily  used  today, 
satisfies  more  than  60%  of  the  connections  for  voice  traffic.  But  for  internet  traffic,  which  is  not  distance 
dependent  as  is  voice  traffic,  a  reach  length  of  about  3000  km  is  sufficient  to  satisfy  60%  of  the 
connections.  If  the  reach  length  is  shorter  than  this,  then  unnecessary  regeneration  is  needed  on  many 
internet  backbone  connections. 
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To  illustrate  this  point,  the  study  calculates  the  fraction  of  traffic  that  can  pass  transparently  through  each 
city,  as  a  function  of  the  reach  length  achievable  by  the  technology,  as  shown  in  Figure  1.  The  pass¬ 
through  traffic  does  not  need  regeneration,  so  this  fraction  represents  the  fraction  of  regenerators  that  can 
be  eliminated  from  the  network  at  each  reach  length.  It  is  clear  from  this  illustration  that  longer  reach  is 
needed  for  efficient  routing  of  internet  traffic  and  that  available  technologies  that  achieve  about  2500  km 
reach  length  can  help  to  eliminate  expensive  regenerators  from  the  network. 


To  quantify  this  concept,  a  cost  study  was  performed  for  the  USA  network  described  above,  considering 
four  different  scenarios.  The  first  scenario  assumes  that  the  configurable  WDM  layer  is  achieved  by 
“Opaque”  crossconnects  that  consist  of  transcievers  surrounding  an  electronic  switch  fabric.  The  other 
three  scenarios  assume  that  the  managed  reach  approach  is  employed,  where  configurability  is  achieved 
with  optical  switch  fabrics,  allowing  only  those  channels  needing  regeneration  to  be  handled  by 
transcievers  and  electronic  switch  fabrics.  These  four  scenarios,  illustrated  in  Figure  2,  show  that  a  modest 
investment  in  optical  fabrics  for  configurability,  coupled  with  technologies  that  achieve  a  reach  length  of 
2500  km  or  more,  can  result  in  a  significant  overall  reduction  in  network  infrastructure  costs. 


Transparent  reach  (Km) 


Figure  1:  Fraction  of  pass-through  traffic. 
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Figure  2.  Value  of  transparent  reach. 


Of  course  long  reach  technologies  require  introduction  of  a  combination  of  new  transmission  approaches, 
such  as  optimized  fibers,  Raman  amplification,  near-perfect  dispersion  compensation,  RZ  modulation 
formats,  and  Forward  Error  Correction  coding  techniques.  But  these  are  well  within  the  scope  of  today's 
capabilities  at  rates  up  to  10  Gb/s  [7],  Additional  challenges  include  providing  technological  solutions  for 
optical  performance  monitoring  and  optical  network  management  of  such  managed  reach  networks. 


Summary 

Traffic  growth  for  internet  applications  will  likely  dominate  long-distance  networks  in  the  foreseeable 
future,  which  presents  opportunities  for  new  optical  layer  approaches  that  are  tailored  to  internet  traffic 
characteristics.  One  important  difference  between  the  traffic  patterns  generated  by  internet  applications  is 
that  the  internet  connections  on  average  are  significantly  longer  than  the  conventional  voice  connections, 
which  allows  for  network  cost  savings  by  combining  long  reach  transport  with  optical  switching 
technologies  to  route  and  manage  those  connections.  Fortunately  there  is  an  industry-wide  effort  to  create 
the  required  technology  for  such  a  managed  reach  optical  layer.  The  value  of  the  optical  layer  will  be  more 
effectively  achieved  as  reach  lengths  increase  to  match  internet  traffic  needs. 
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1.  Introduction 

The  introduction  of  Internet,  firstly  driven  by  an  enterprise  traffic  increasing  dramatically  the  volume  of 
informations  transported,  and  followed  by  an  offensive  introduction  of  new  applications  at  the  subscriber  side, 
pushes  telecom  company  to  think  differently.  To  propose  short  term  solution,  the  ATM  community  has  proposed  to 
superpose  layers  such  as  IP/ATM/SONET/WDM  whereas  the  IP  community  tries  to  offer  a  Quality  of  Service 
through  the  introduction  of  Int-Serv  and  Diff-Serv  techniques  /1-2/.  For  the  optical  community,  the  integration  of  an 
electronic  IP  router  with  an  optical  cross-connect  (OXCs)  to  provide  the  required  flexibility  offered  by  the  IP 
network  together  with  an  optimised  transport  capacity  offered  by  the  OXCs  has  been  proposed. 

In  this  paper  we  propose  a  new  IP&ATM/Optical  packets/WDM  layer.  Studied  in  the  frame  of  a  French  Research 
program  called  ROM,  circuit  and  packet  switching  techniques  are  mixed  on  a  single  optical  platform  with  adequate 
interfaces  to  support  a  high  quality  of  service.  In  particular,  the  introduction  of  the  optical  technology  required  to 
offer  the  capacity  of  transmission  pushes  naturally  equipment  constructors  to  envisage  a  new  framing  at  the 
transmission  side. 

2.  Optical  circuit  and  packet  switching 

To  cope  with  the  traffic  volume  increase,  the  first  efforts  were  focused  on  optical  cross-connects  to  provide  a  robust 
and  secure  technique  to  switch  wavelengths.  However,  with  the  traffic  profile  evolution,  the  solution  is  probably  in 
the  introduction  of  optical  packet  switching  techniques  /3-4/  in  the  metro  and  in  the  backbone.  Then,  circuit 
switching  will  be  always  mandatory,  to  establish  connections  aiming  to  interconnect  fibers  through  a  network 
topology  in  presence  of  a  stable  traffic  matrix  or  to  provide  leased  links.  Thus,  the  future  optical  network  will 
undoubtedly  mix  circuit  switching  and  packet  switching  techniques  in  MANs  and  WANs. 

3.  IP  and/or  ATM  /Optical  packets/WDM  for  the  backbone 

At  the  backbone  level,  the  proposed  solution  is  based  on  an  optical  platform  capable  to  interconnect  client  layers 
through  electro-optic  edges. 

Two  scenarios  could  be  envisaged.  The  first  one  is  based  on  a  burst  switching  mode.  The  optical  packet,  created  in 
the  edges,  has  a  variable  duration  (to  be  adapted  to  the  IP)  with  a  minimum  dimension  :  the  time  slot.  The  advantage 
of  this  approach  is  to  have  only  one  header  for  one  or  a  group  of  time  slots  opening  the  way  to  large  switch 
capacities.  The  second  approach  could  consist  to  adopt  long  packets  (several  ps  lengths)  taking  benefit  of  the  high 
level  of  aggregation  of  the  traffic  profile.  In  both  cases,  the  header  can  be  transported  on  the  same  wavelength  than 
the  payload  or  simply  carried  on  an  other  wavelength. 

At  the  core  node  level,  the  massive  presence  of  the  WDM  dimension  and  the  low  number  of  neighbors  surrounding 
a  core  node  opens  the  way  to  fast  optical  cross-connects.  The  contention  resolution  is  achieved  thanks  to  the 
statistical  distribution  of  colored  packets.  However,  in  the  edge  node,  an  optical  switch  is  mandatory  to  solve  the 
contention  in  real  time  before  acceding  to  the  electronic  interfaces  bridging  the  client  layers.  Figure  2  describes  the 
core  node  structure  envisaged  with  a  potential  throughput  of  10  Tbit/s  and  a  Packet  Loss  Rate  (PLR)  better  than  10'6 
for  a  0.7  load,  whereas  figure  3  shows  the  structure  of  the  interface  required.  The  structure  of  the  switch  :  a  fast 
cross-connect,  facilitates  the  circuit  switching  since  it  is  easy  to  activate  through  a  wavelengths  partitioning. 

4.  IP  and/or  ATM/Optical  packets/WDM  for  the  metro 

The  metro  is  probably  the  more  critical  case  since  the  traffic  profile  is  sporadic,  close  to  a  self-similar  traffic  profile 
for  the  data,  and  close  to  a  Poisson  traffic  profile  for  the  voice  &  video.  In  addition,  the  WDM  dimension  being  not 
fully  exploited  for  cost  reasons,  a  network  must  operate  with  a  low  number  of  available  wavelengths  at  least  at  the 
beginning.  For  these  reasons,  a  switching  architecture  must  tolerate  4  wavelengths  per  port,  and  small  packets  (< 
1  (as)  to  don’t  impact  the  packet  transport  efficiency  while  maintaining  constant  the  logical  performance.  This 
imposes  the  adoption  of  low  capacity  optical  packet  switching  matrices  (below  320  Gbit/s  :  demonstrated  feasible  in 
SOA-based  Broadcast-and-Select  packet  switching  architectures)  with  a  high  level  of  flexibility  to  adapt  the 
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capacity  to  the  demand  evolution  while  keeping  the  logical  performance  at  a  constant  value.  Figure  4  illustrates  a 
possible  introduction  scenario  in  the  network:  meshed  topologies  at  a  regional  level  and  ring  topologies  for  the  urban 
part.  Figure  5  describes  a  possible  architecture  for  the  metro.  It  has  been  designed  to  be  folly  flexible  in  term  of  bit 
rate  evolution  by  introducing  optical  buffers  and  providing  bit  rate  adaptation  in  the  3R  regenerators.  It  is  basically 
composed  of  a  number  of  parallel  planes,  interconnected  with  a  transverse  plane  aiming  to  reshape  the  traffic  profile 
in  order  to  have  an  additional  flexibility  on  the  load  evolution.  Circuit  switching  can  be  activated  in  one  parallel 
plane  through  a  wavelength  partition  reservation.  Typically,  the  architecture  can  offer  a  capacity  form  40  Gbit/s  to 
1.28  Tbit/s  by  adding  planes  or  by  increasing  foe  input  bit  rate  from  2.5  to  10  Gbit/s  and  a  PLR  better  than  10‘10  for  a 


0.6  load. 


1 


Figure  2  :  Structure  of  the  core  node 
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Figure  3  :  Structure  of  the  edge  node 


Figure  4  :  Introduction  scenario  in  the  metro  Figure  5  :  Structure  of  the  core  node  for  the  metro 

6.  Transmission  requirements 

In  both  introduction  scenarios,  the  transmission  represents  an  important  issue.  Since  optics  suffer  from  a  lack  of 
digital  memory,  for  cost  reasons,  the  synchronization  at  the  bit  level  is  not  recommended  inside  the  optical  network, 
whereas  a  synchronization  at  the  time  slot/packet  level  is  still  mandatory  to  optimize  the  logical  performance  or  to 
simplify  the  electronic  circuitry  dedicated  to  the  control  of  the  node.  It  is  then  vital  to  envisage  an  asynchronous 
packet  transport  with  a  new  framing  (FEC  for  the  backbone)  adapted  to  the  optical  technology. 

7.  Conclusions 

In  this  paper  we  have  described  how  optical  techniques  could  be  used  to  offer  at  the  same  time  capacity  of  transport 
required  and  flexibility  to  envisage  a  progressive  but  rapid  evolution  of  the  node  throughputs.  The  paper  describes 
two  kinds  of  optical  nodes  enabling  circuit  and  packet  switching  connections.  Two  introduction  scenarios  have  been 
proposed  :  one  for  the  backbone  and  a  second  for  the  metro.  A  strong  need  for  a  new  framing  in  the  point-to-point- 
transmission  layer  is  highlighted.  This  paper  clearly  indicates  that  optical  techniques  have  a  great  potential  in  the 
network.,  not  only  for  the  point-to-point  transmission  but  also  in  the  nodes  to  simplify  the  protocol  and  to  converge 
progressively  towards  ATM&IP/Optical  packets/WDM  while  being  compatible  with  QoS  requirements. 
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Introduction 

The  fast  growing  of  Internet  pushes  the  evolution  of  optical  transport  networks.  Wavelength  Division 
Multiplexing  (WDM)  offers  a  new  dimension  of  the  transportation  of  IP  traffic.  By  using  WDM,  the 
transport  capacity  of  optical  fibers  and  systems  expands  greatly.  It  is  more  important  that  WDM  provides 
some  new  features,  i.e.  the  wavelength  routing,  and  some  new  protection  and  restoration  features.  In  order 
to  fully  utilize  these  capabilities,  we  have  to  develop  new  architectures  to  import  IP  traffic  into  WDM 
highways  while  providing  Quality  of  Service  management  in  a  cost  efficient  way. 

The  interaction  of  the  IP  layer  and  the  optical  layer  is  of  much  concern.  Recently  several  new  proposals  in 
IETF  were  introduced  for  IP  over  WDM  [AR,  CH,  BA,  KN,  RS,  WF,  FS].  Multi-Protocol  Lambda 
Switching  (MPLS),  cooperating  with  OSPF  with  optical  routing  extensions,  is  very  promising  to  become 
the  intermediate  layer  between  IP  and  WDM. 

This  paper  focuses  on  the  architectures  for  the  efficient  transport  of  IP  traffic  on  WDM  networks  and 
proposes  a  functionally  separate  intermediate  layer  between  the  IP  network  layer  and  the  WDM  optical 
transport  layer.  First,  we  summarize  the  roles  of  IP  routers  in  the  WDM  optical  switched  networks.  Then 
we  describe  the  definition  of  the  proposed  integrated  slim  layer  for  IP  over  WDM. 

IP  routers  in  the  WDM  optical  switched  networks 

There  are  three  approaches  of  interaction  between  the  IP  layer  and  the  optical  layer  [GH],  The  first  one  is 
the  traditional  overlay  approach,  where  the  IP  routers,  together  with  the  links,  form  a  virtual  topology.  In 
the  second,  IP  routers  have  the  intelligence  of  resource  management  of  both  IP  network  layer  resources, 
e.g.  electronic  buffers,  and  the  WDM  optical  layer  resources,  e.g.  optical  cross-connects,  converters,  etc. 
In  the  third,  IP  routers  are  totally  black  boxes  where  only  optical  interfaces  can  be  seen  from  outside  and 
all  the  resource  management  becomes  the  build-in  function  of  the  IP  routers. 

It  is  argued  that  the  second  approach  outcomes  the  other  two  in  the  aspects  of  better  leverage  the  strength 
of  IP  and  WDM.  In  this  case,  the  IP  routers  play  the  role  of  both  traffic  bearing  and  resource 
management.  Actually  here  the  IP  routers  are  the  functional  combination  of  the  traditional  packet  store 
and  forward  engine  and  the  network  management  agent. 

Integrated  slim  layer  for  IP  over  WDM 

It  is  necessary  to  make  the  intermediate  layer  between  the  IP  layer  and  the  WDM  layer  as  simple  as 
possible,  therefore  the  trend  is  to  integrate  the  functions  of  several  traditional  layers  between  the  IP  layer 
and  the  WDM  layer  instead  of  using  several  overlay  layers  for  different  functions.  It  will  be  beneficial 
that  IP  router  is  responsible  for  all  non-local  management  functions,  including  the  management  of  optical 
resources,  configuration  and  capacity  management,  addressing,  routing,  traffic  engineering,  topology 
discovery,  exception  handling  and  restoration  [CH].  The  proposal  of  the  integrated  slim  layer  is  to 
encapsulate  some  implementation  details  and  provide  a  uniform  interface  to  the  IP  network  layer.  It  will 
localize  some  functions  which  are  closely  dependent  on  the  implementation.  Therefore  it  is  important  for 
the  generalization  of  non-local  management  functions  of  the  IP  router. 

The  functions  of  the  integrated  slim  layer  must  include  but  not  limit  to  the  following. 

L  Framing.  IP  packets  have  to  use  a  suitable  frame  structure  to  be  transmitted  in  the  WDM  channels. 
Both  the  source  node  and  the  destination  node  rely  on  this  function  of  the  integrated  slim  layer  to 
simplify  the  operation  of  IP  layer. 
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2.  Performance  monitoring.  Local  performance  monitoring  ( not  end-to-end  performance  monitoring), 
which  may  highly  depend  on  the  implementation,  is  another  important  function  of  the  integrated  slim 
layer.  Since  the  integrated  slim  layer  is  likely  to  act  as  the  driver  module  of  specified  optical 
connections,  it  will  have  more  detailed  information  about  the  structure  and  the  components  in  the 
optical  cross-connection.  This  makes  it  possible  to  monitor  the  status  of  the  lower  layer  WDM 
system.  It  is  also  the  responsibility  of  the  integrated  slim  layer  to  translate  the  physical  element 
parameters  to  the  required  format  of  status  reports  or  alarms  which  is  understandable  by  the  IP  layer. 
The  operation  of  performance  monitoring  may  be  done  periodically  or  on  demand  upon  receiving  the 
command  from  the  IP  layer. 

3.  Lightpath  maintenance.  When  and  which  a  lightpath  should  be  set  up  or  tom  down  is  the  decision  of 
the  IP  layer,  while  the  action  of  setting  up  or  tearing  down  a  lightpath  is  executed  by  the  integrated 
slim  layer.  This  particular  function  of  the  integrated  slim  layer  is  also  referred  to  as  a  mediation 
device  function,  i.e.  a  vendor  specified  controller  function  [CH], 

4.  Localized  protection  switching.  It  will  be  potentially  advantageous  to  implement  protection  and 
restoration  at  different  levels,  for  example,  the  service  restoration  in  TCP  layer,  the  transport  path 
protection  in  IP  layer.  The  integrated  slim  layer  may  do  some  fast  localized  protection  switching 
without  consulting  the  IP  layer.  In  order  to  maintain  the  consistence  of  the  protection  and  restoration 
operations  at  different  layers,  the  integrated  slim  layer  will  temporarily  lock  the  protection  switching 
command  issued  by  the  IP  layer.  The  timing  of  localized  protection  switching  of  the  integrated  slim 
layer  must  be  strictly  defined.  If  within  the  time  bound  the  protection  switching  is  not  finished,  the 
integrated  slim  layer  must  give  up  immediately  in  order  to  leave  the  protection  function  to  the  IP 
layer.  Localized  protection  switching  may  include  the  switching  from  a  fault  component  to  a  local 
abundant  component. 

Conclusion 

In  order  to  make  the  IP  layer  independent  of  the  specified  implement  of  the  optical  cross-connects,  we 
propose  to  use  an  integrated  slim  layer  to  encapsulate  the  implementation  details.  It  integrates  the 
functions  of  framing,  performance  monitoring,  lightpath  maintenance,  and  localized  protection  switching. 
These  functions  traditionally  belong  to  different  layers.  The  integrated  slim  layer  can  be  implemented  as 
the  driver  module  of  the  optical  transport  system. 
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One  approach  to  providing  the  benefits  of  the  high  data  rates  afforded  by  optics  is  to  attempt  to  im¬ 
plement  traditional  electronic  and  electro-optic  approaches  using  optical  technologies.  In  electronics,  op¬ 
erations  such  as  buffering,  adding  packets  and  dropping  packets,  or  merging  packet  streams,  are  done  with 
ease.  In  optics,  however,  buffering  is  onerous.  Operations  such  as  retrieving  a  packet  from  a  traffic  stream 
affect  the  whole  stream.  The  routing  approach  we  propose  seeks  to  make  use  of  the  strengths  of  optics  and 
avoids  operations  that  are  cumbersome  or  expensive  in  optics.  Our  goal  is  to  provide  robust  and  reliable 
approaches  to  access  at  optical  data  rates.  Our  design  rationale  is  the  following: 

Avoid  buffering.  Buffering  in  the  network,  with  the  attendant  issues  of  cost  and  possibility  of  overflow, 
is  to  be  avoided.  Therefore,  we  avoid  traffic  merging.  Traffic  splitting,  on  the  other  hand,  does  not  require 
buffering. 

Do  not  use  a  switch  unless  necessary.  While  traffic  to  remote  locations  should  be  handled  by  such 
switches,  local  traffic  need  not  be  handled  by  these  switches.  It  is  desirable  to  isolate  local  traffic  from  the 
vagaries  of  congestion  associated  with  external  traffic.  Finally,  switches  are  expensive  and  extensibility  is 
often  difficult.  Thus,  allowing  traffic  that  does  not  need  to  be  routed  through  switches  to  bypass  switches 
avoids  unnecessary  costs  and  dismptions  caused  by  switch  upgrades. 

Use  the  optical  layer  to  provide  recovery.  Our  routing  is  well  suited  to  packet-switched  traffic,  but 
does  not  rely  on  a  specific  protocol  such  as  IP  or  ATM.  Instead,  our  access  scheme  operates  below  such 
protocols.  Recovery  is  performed  at  the  optical  layer,  in  a  manner  which  is  robust,  simple  and  rapid  enough 
that  recovery  at  higher  layers  does  not  need  to  be  triggered. 

We  consider  routings  that  allow  automatic  recovery  from  link  failures.  An  undirected  graph  G  =  (N,  E ) 
is  a  set  of  nodes  N  and  edges  E.  With  each  edge  [x,  y]  of  an  undirected  graph,  we  associate  two  directed 
arcs  ( x ,  y )  and  (y,  x).  Each  arc  corresponds  to  one  of  the  fibers  forming  the  link.  We  assume  that  if  an  edge 
[x,  y\  fails,  then  arcs  ( x ,  y)  and  (y,  x )  fail.  Assume  that  the  graph  G  is  two  edge-connected,  i.e.,  removal  of 
an  edge  leaves  the  graph  connected,  which  is  necessary  to  make  recovery  from  link  failures  feasible. 

We  describe  a  routing  such  that  recovery  is  possible  even  in  the  event  of  any  link  failure.  Our  routing 
consists  of  two  parts:  a  “collection”  portion  and  a  “distribution”  portion.  The  collection  portion  allows 
all  nodes  to  place  their  traffic  on  the  access  wavelength(s)  and  the  distribution  portion  ensures  that  packets 
can  reach  all  nodes.  The  architecture  we  consider  is  the  following:  nodes  place  traffic  on  a  shared  optical 
medium  (we  consider  a  single  wavelength  onto  which  traffic  is  written  and  a  single  wavelength  off  of  which 
traffic  is  read).  Traffic  that  needs  to  be  sent  to  or  received  from  outside  the  local  area  can  go  through  a 
switch,  which  acts  as  any  other  node,  placing  traffic  onto  the  collection  wavelength  and  retrieving  traffic  off 
the  collection  wavelength.  Local  traffic  need  not  be  handled  by  the  switch. 

The  collection  portion  of  the  routing  is  constructed  as  follows.  We  select  a  root  node  and  build  a  depth- 
first  search  (DFS)  numbering  beginning  at  the  root  node.  The  routing  is  a  walk  that  traverses  nodes  as  they 
are  considered  by  the  DFS  numbering  algorithm.  A  collection  routing  traverses  every  node  at  least  once. 
Moreover,  it  traverses  each  edge  at  most  twice.  If  an  edge  is  traversed  twice,  it  is  in  opposite  directions  at 
each  time.  Thus,  each  arc  is  traversed  at  most  once  and  so  a  single  wavelength,  in  both  directions,  is  sufficient 
to  establish  the  collection  portion  of  the  routing.  The  solid  lines  in  Figure  1  show  a  DFS  construction  on  a 
sample  network. 
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Figure  1:  Example  DFS  tree  and  labeling.  The  graph  shown  is  the  NJ  LATA  network.  An  expanded  version  of  node  7  is  labeled 
with  ancestor  (A),  descendant  (D),  and  tree  (T)  markings.  Link  styles  indicate  their  use  in  the  access  collection  recovery  protocol. 

The  distribution  portion  of  the  routing  is  a  directed  spanning  tree  rooted  at  the  last  node  traversed  by 
the  collection  routing.  Call  this  tree  the  primary  tree.  Robustness  is  afforded  in  the  distribution  section  by 
constructing  a  secondary  tree,  which  shares  the  root  but  no  arcs  with  the  primary  tree,  and  such  that  removal 
of  any  edge  (and  its  two  associated  arcs)  leaves  the  root  connected  to  every  node  on  at  least  one  of  the 
trees.  Such  trees  were  first  introduced  in  [1,  4],  using  s-t  numberings  ([2]),  and  a  more  general  method  of 
constructing  them  was  given  in  [3],  A  single  wavelength,  used  in  both  directions,  is  sufficient  to  construct 
the  primary  and  secondary  trees.  The  roots  of  the  primary  and  secondary  tree  must  therefore  be  able  to 
perform  wavelength  conversion,  by  placing  the  traffic  from  the  collection  routing  onto  the  distribution  trees. 
We  only  require  two  nodes  to  perform  wavelength  conversion. 

Our  algorithm  performs  recovery  separately  both  in  the  collection  and  the  distribution  portions  of  the 
routing,  even  though  they  co-exist  over  a  common  physical  infrastructure.  The  crux  of  our  algorithm  lies 
in  our  method  of  performing  link  recovery  in  the  collection  portion  of  the  routing.  Suppose  that  link  [i,j] 
fails.  If  link  [i,j]  is  not  included  in  the  DFS  tree,  then  its  failure  leaves  the  collection  routing  unaffected. 
We  therefore  need  only  consider  the  case  where  link  [i,  j]  is  included  in  the  DFS  tree.  Assume  wlog  that  the 
node  i  is  the  ancestor  of  j.  Failure  of  \i,j\  entails  disconnection  of  all  the  descendants  of  i.  From  the  DFS 
construction  and  the  fact  that  that  we  have  a  two-edge-connected  graph,  some  descendant  of  j  must  have  an 
edge  connecting  it  to  some  ancestor  of  i.  We  have  developed  a  distributed  algorithm  that  uniquely  selects 
one  such  link  and  uses  it  to  reconnect  j  and  its  descendants  to  the  upper  partition  of  the  collection  route.  For 
the  distribution  portion,  each  node  downstream  of  a  link  failure  in  the  primary  tree  switches  to  receiving  on 
the  secondary  tree. 

We  have  extended  this  approach  to  node  failures  and  have  developed  an  algorithm  for  implementing  this 
routing  and  the  associated  recovery. 
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1.  Introduction 

Wavelength  filters  with  large  channel  numbers  and  narrow  channel  spacings  are 
quite  important  in  dense  WDM  systems.  Other  important  application  of  the  filter  is  a 
spectrum  slicing  of  supercontinuum  light  to  make  multiwavelength  source  for  WDM.  This 
paper  reviews  the  current  performances  and  future  prospects  of  arrayed-waveguide  grating 
(AWG)  filters. 

2.  Large  Channel  with  Narrow  Spacing  AWGs 

Crosstalks  in  AWGs  are  attributed  to  the  phase  and  amplitude  fluctuations  in  the  total 
electric  field  profile  at  the  output  side  array-slab  interface  since  the  focused  beam  profile  is 
the  spatial  Fourier  transform  of  the  electric  field  in  the  array  waveguides  [1].  Phase 
fluctuations  are  caused  by  the  nonuniformity  of  refractive  indices  and  core  geometries  in  the 
array  waveguide  region.  By  the  improvement  of  the  waveguide  fabrication  technologies,  the 
crosstalks  of  less  than  -33  dB  have  been  achieved  in  256ch-25GHz  spacing  AWG  as  shown 
in  Fig.  1.  Geometry  and  index  difference  of  the  core  are  4.5  fimx 4.5  fim  and  1.5  %,  which 
enables  the  minimum  bending  radius  to  be  2  mm.  This  AWG  was  fabricated  on  single  wafer 
of  4-inch  Si. 

The  maximum  available  wafer  size  of  PLC  is  limited  by  the  fabrication  apparatus  such  as 
electric  furnace  and  mask  aligner  etc.  Therefore,  it  is  not  so  easy  to  fabricate  AWG  with 
larger  channel  numbers.  One  possible  way  to  increase  the  total  number  of  channels  is  to  use 
two  kinds  of  AWGs,  which  are  cascaded  in  series.  We  made  320ch-10GHz  spacing  AWG  by 
cascading  a  32  ch-lOOGHz  spacing  flat  AWG  as  a  primary  filter  and  a  16ch-10GHz  spacing 
AWG  (10  channels  are  used)  as  a  secondary  filter  [2],  Fig.  2  shows  the  configuration  of  a 
cascaded  multiplexer  to  realize  320-channel  AWG  with  10  GHz  spacing.  Fig.  3  shows  the 
demultiplexing  properties  for  the  entire  320  channels.  Crosstalk  to  the  neighboring  channels 
is  about  -20  dB.  The  transmission  losses  range  from  14  to  21  dB.  Loss  variation  can  be 
reduced  by  employing  a  primary  AWG  whose  pass  bands  overlap  at  lower  loss  value. 
Though  crosstalk  and  insertion  loss  characteristics  are  not  satisfactory,  these  performances 
will  be  improved  by  the  improvement  of  transmission  characteristics  of  each  stage  of 
AWGs. 

3.  Summary 

Large  channel  AWGs  that  are  realized  on  single  wafer  and  with  the  cascaded 
configuration  are  reviewed.  Functional  PLCs  such  as  frequency  spectrum  synthesizer  [3] 
and  coherent  optical  transversal  filter  [4]  will  also  be  presented  in  the  conference. 
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I.  Introduction: 

Future  optical  networks  are  envisioned  to  be  dynamically  reconfigurable  -  the  optical  channels  are 
added/dropped  or  cross-connected  frequently  in  the  optical  layer  using  wavelength-division-multiplexing 
(WDM)  technologies  [1],  For  the  efficient  operation  and  maintenance  of  these  network,  however,  it  is 
essential  to  monitor  various  parameters  including  the  wavelength,  optical  power,  optical  signal-to-noise 
rations  (OSNR’s),  and  optical  paths,  etc.  In  this  paper,  we  review  the  current  status  of  WDM  monitoring 
techniques  and  introduce  the  relevant  research  activities  at  KAIST. 


II.  Wavelength  Monitoring  Techniques 

Recently,  there  have  been  substantial  efforts  to  develop  the  wavelength-monitoring  techniques  for 
WDM  networks  [2]-[7],  For  example,  it  has  been  demonstrated  that  the  wavelengths  of  multiple  WDM 
channels  could  be  monitored  by  using  various  types  of  tunable  filters  such  as  a  PZT-tuned  Fabry-Perot 
filter,  an  acousto-optic  tunable  filter,  and  a  temperature-tuned  etalon  filter  [2]-[4],  However,  these 
techniques  not  only  require  elaborate  tuning  mechanisms,  but  also  have  insufficient  resolution.  To  avoid 
these  tuning  mechanisms,  it  has  been  proposed  to  use  diffraction  gratings  such  as  a  fiber  Bragg  grating 
(FBG)  or  an  arrayed  waveguide  grating  (AWG)  with  a  photodiode  array  [5]-[6],  However,  these 
techniques  could  be  quite  expensive,  as  they  require  a  large  number  of  photodetectors  to  cover  a  wide 
bandwidth  at  high  resolution.  Another  monitoring  technique  utilized  pilot  tones  and  a  fixed  etalon  filter 
[7].  However,  due  to  the  pilot  tones,  the  performance  of  this  technique  could  be  impaired  by  the  cross 
gain  modulation  of  EDFA  and  stimulated  Raman  effect  [8]-[9].  We  will  review  these  techniques  and 
discuss  their  advantages  and  disadvantages. 


III.  OSNR  Monitoring  Techniques 

To  monitor  the  performance  of  WDM  networks,  it  is  essential  to  measure  the  OSNR  of  each  channel. 
Previously,  it  has  been  demonstrated  that  the  OSNR  could  be  measured  by  linearly  interpolating  the  ASE 
noise  level  at  the  carrier  wavelength  [10]-[1 1],  However,  in  a  dynamically  reconfigurable  WDM 
network,  each  channel  may  traverse  through  different  route  and  different  number  of  optical  amplifiers. 
Thus,  in  this  network,  the  OSNR  could  not  be  monitored  accurately  by  the  conventional  linear 
interpolation  technique.  To  overcome  this  problem,  we  have  recently  developed  new  techniques  based 
on  (1)  the  receiver  noise  characteristics  and  (2)  the  polarization  extinction  ratio  of  each  WDM  signal  [12]- 
[13].  We  will  discuss  the  current  status  of  these  techniques  and  future  research  directions. 


IV.  Optical  Path  Monitoring  Technique 

The  WDM  transport  network  requires  reliable  monitoring  techniques  not  only  for  optical  performance 
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but  also  for  channel  identification.  In  particular,  for  the  proper  management  of  these  networks,  it  is 
essential  to  monitor  the  optical  paths  of  WDM  signals  in  the  optical  cross-connects  (OXC’s).  Recently, 
several  techniques  have  been  proposed  for  this  purpose  [14]-[17].  These  techniques  utilized  Iow- 
ffequency  tones  in  the  range  of  0.1  ~  5  kHz.  However,  these  low-frequency  tones  could  propagate  to  the 
next  node  and  accumulate  in  the  amplified  WDM  network.  In  addition,  the  low  frequency  tones  could 
generate  the  “ghost  tones”  due  to  the  slow  dynamic  properties  of  EDFA’  s  [8],  To  avoid  these  problems, 
it  has  been  proposed  to  erase  the  pilot  tones  at  the  outputs  of  OXC’s  using  voltage-controlled  attenuators 

[15].  However,  these  techniques  could  not  be  used  for  the  channel  identification  within  the  entire 
network.  To  overcome  this  problem,  we  have  proposed  to  monitor  the  optical  paths  in  the  OXC’ s  using 
dual  high-frequency  tones  (>100  kHz).  The  two  tones  were  used  to  identify  the  originating  node  and 
individual  channel,  respectively.  Thus,  unlike  the  previous  techniques,  the  proposed  technique  could 
monitor  the  optical  paths  not  only  within  the  OXC,  but  also  within  the  entire  network  and  reduce  the 
required  number  of  pilot  tones  considerably. 


V.  Summary 

For  the  efficient  operation  of  WDM  network,  it  would  be  desirable  to  monitor  the  key  parameters  in  the 
optical  layer.  Thus,  various  techniques  have  been  proposed  to  monitor  the  wavelengths,  optical  powers, 
OSNR’s,  and  optical  paths  in  the  DWDM  networks.  In  this  paper,  we  reviewed  the  current  status  of 
these  technologies  and  discussed  future  directions.  Although  some  of  these  technologies  are  promising, 
it  appears  that  most  of  them  are  still  in  their  early  stages  and  should  be  improved  further  for  the  practical 
deployment  in  the  rapidly  evolving  DWDM  network. 
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WDM  networks  are  becoming  increasingly  complex  as  the  number  of  wavelengths  (channels)  carried  on  a  single 
fiber  increases.  Channels  are  added  and  dropped  at  various  sites,  and  exchanged  between  two  of  more  optical 
networks.  Optical  channel  monitoring  in  a  WDM  system  implies  measurement  and  tracking  of  the  wavelength  of  all 
channels.  This  information  can  be  used  for  network  elements  control,  network  maintenance,  and  system  performance 
monitoring.  At  this  aim,  various  tunable  filters  at  the  wavelength  of  1300  and  1550  nm  have  been  demonstrated1. 
They  are  key  components  in  WDM  apparatus  as  well  as  in  a  wide  range  of  fiberoptic  sensors  and  measurement 
applications.  So  far,  a  variety  of  approaches  ranging  from  the  Pockels  effect  in  LiNb03,  current  injection  in 
semiconductor,  electrooptic  effect  in  liquid  crystals  and  most  recently  microelectromechanical  activated  devices  have 
been  exploited.  In  almost  all  these  proposals  the  filter  linewidth  is  larger  than  a  few  nanometers  and  further  degrades 
with  tuning.  However,  for  most  applications,  including  the  dense  WDM  systems,  filters  linewidths  narrower  than 
0.5nm  and  free  spectral  range  of  tens  of  nanometers  are  required. 

In  this  paper  we  propose  a  simple  and  low-cost  solution  for  tracking  the  frequency  of  WDM  channels,  based  on 
the  thermooptic  tuning  of  Fabry-Perot  silicon  optical  filters.  The  fabricated  structures  exhibit,  thanks  to  a  suitable 
coating  stack  on  both  cavity  sides,  high  finesse  and  narrow  bandwidth.  Moreover,  the  free  spectral  range  is  large 
compared  to  the  channel  spacing,  allowing  the  monitoring  of  one  carrier  frequency  at  a  time.  By  means  of  a  heater 
we  change  the  cavity  refractive  index  and  move  the  transmission  peaks,  thus  scanning  the  WDM  frequency  set.  In 
particular,  in  a  preliminary  prototype  realized  by  using  a  100-|Xm-long  silicon  cavity  we  demonstrate  the  possibility 
to  resolve  up  to  seven  50-GHz-spaced  channels,  with  a  crosstalk  of -10  dB,  at  wavelengths  around  1550  nm.  More 
channels  are  easily  resolvable  by  using  shorter  cavities. 

In  particular,  commercial  silicon  wafers  with  nominal  thickness  of  lOOtlO  pm,  giving  rise,  for  X  =  1 .55  pm,  to  a 
theoretical  free  spectral  range  (FSR)  A/  =  432.3  GHz,  or  about  3.45  nm,  and  with  both  sides  polished  to  obtain  a 
very  good  optical  quality,  were  used  as  etalons.  Using  the  experimental  setup  which  is  described  later  in  this  section, 
we  checked  the  surfaces  characteristics  measuring  the  spectral  response  of  the  wafer  itself  before  realizing  the 
coating  stack,.  The  measured  spectrum  is  practically  identical  to  the  theoretical  one,  so  verifying  the  high  surface 
optical  quality.  The  measured  FSR  results  to  be  a  little  bit  smaller  (3.27  nm)  than  the  expected  one,  so  indicating  a 
real  wafer  thickness  of  105  pm.  Then,  we  realized  on  both  surfaces  of  a  similar  wafer,  by  means  of  e-beam 
evaporation  from  solid  sources,  alternated  X/4  layers  of  silicon  dioxide  Si02  (refractive  index  n/m  =  1.44)  and 
amorphous  silicon  a:Si  (n*,^  =  3.48).  The  thickness  of  such  layers  are  Li„„  ~  270  nm  and  =  1 10  nm  respectively, 
for  a  central  wavelength  X=  1.55  pm.  The  layer  thickness  and  refractive  index  were  checked  by  means  of  alpha-step 
and  ellipsometric  measurements.  The  deposition  process  guarantees  uniformity,  low  roughness  and  good  parallelism 
of  the  multi-layer  mirrors. 

The  spontaneous  emission  ASE  of  an  Erbium  Doped  Fiber  Amplifier  (EDFA)  was  used  as  broadband  light 
source  in  the  1550  nm  region.  The  spontaneous  ASE  emission  of  the  EDFA  extends  in  the  range  1500-1600  nm,  but 
is  acceptably  uniform  between  1540  and  1560  nm.  Its  power  level  is  about  +3  dBm.  A  62.5/125  multi-mode  optical 
fiber  brings  the  light  from  the  EDFA  output  connector  to  the  filter  under  test,  which  is  firmly  mounted  over  a  Peltier 
cell.  A  second  62.5/125  multi-mode  optical  fiber  collects  the  light  transmitted  through  the  filter  and  guides  it  to  an 
Optical  Spectrum  Analyzer  (OSA)  for  spectrum  measurements.  Both  fiber  end-facets  are  a  few  millimeters  distant 
from  the  filter  surfaces,  so  minimizing  the  undesired  interference  effects  in  the  air  gap.  This  choice  implies  a 
collected  power  reduction,  but  the  high  OSA  sensitivity  (-70  dBm)  allows  to  overcome  this  problem.  The  input  and 
the  output  fiber  holder  and  the  Peltier  cell  are  mounted  on  three  micro-positioning  stages,  allowing  a  proper 
alignment  of  the  optical  system.  The  Peltier  cell  is  electrically  connected  to  a  constant  current  generator,  so 
permitting  a  fine  temperature  control.  The  temperature  is  checked  by  a  calibrated  thermistor  fixed  on  file  Peltier.  A 
first  filter  was  made  with  a  single  coating  Si02-a:Si  pair  on  each  side.  The  measured  FSR  is  421.2  GHz,  which 
corresponds  to  3.37  nm,  and  indicates  a  cavity  length  L  =  102.6  pm.  The  measured  3-dB  bandwidth  is  66.25  GHz 
(0.53  nm)  and  it  is  relatively  larger  than  the  theoretical  one.  The  global  finesse  calculated  from  these  two  parameters 
is  6.5,  which  is  about  2.5  times  lower  than  the  theoretical  one.  The  finesse  reduction  implies  also  a  contrast 
reduction,  which  goes  from  19  dB  of  the  simulated  curve,  to  15  dB  of  the  experimental  one.  This  fact  can  be  easily 
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explained,  as  previously  discussed,  referring  to  0l  i  ;  !  i  |  r~j  j  IS 

imperfection  of  the  fdter  and  of  the  coating  layers  A  jfl  |  j  i 

surfaces  (i.e.,  roughness  and  non-parallelism)  and  to  _5 . M . I . || . i . M  4 . i . Jif . 

further  absorption  into  the  coating  layers.  Ii|  |  !iil  Iji  ;  |  /||| 

In  fig.  1  are  reported  the  experimental  and  ^  !:l  |  jjil  /  !;  ;  |  !;\ 

the  calculated  transmission  spectra  of  a  silicon  filter  jg_  ‘10  7f  yV . f . Tfjl . ] . T'jfjj . H't'n . * 

whose  mirrors  are  made  of  two  SiQ-a:  Si  pairs.  The  &  I;  jl  j  /;;  ;\  I  j  il|  i/jil 

comparison  between  theoretical  and  experimental  ;e  ,15  . J . /.lil . j . /  1 1 . j . 

data  shows  that  the  e-beam  evaporation  can  produce  I  /  M  \  j  /  j  |  \  /  (  :  ?  j  i  \ 

coating  layers  of  good  quality.  The  FSR  is  408.7  J  /  j  \\\  /  J  5  \  |  j  /  /j  ;  \  \ 

GHz,  corresponding  to  3.27  nm  and  indicating  a  '20 . / . V" . 'y  "  . . '/ . . v  Aj 

cavity  length  of  about  105  Jim.  The  bandwidth  of  /  \  pN#r  /  i  \  /  |  |  /  \ 

this  filter  is  reduced  to  30  GHz,  or  0.24  nm,  and  the  _25  /, . \  _j . /_| . . j . ! . ]\ . . \ . 

corresponding  finesse  is  about  13.7.  The  maximum  f  \1  /  |  \  I  /  ;  \  / 

contrast  is  22  dB.  The  calculated  spectrum  shows,  ?\..y  \j./  i 

as  expected,  a  smaller  bandwidth  and  consequently  - iM4 - - Tms - i~mo“ . 1552 

a  higher  finesse  of  about  45,  and  a  contrast  of  28  wavelength  [nm] 

dB.  In  this  case  too,  the  real  filter  performance  is 

penalized  by  several  factors  connected  to  the  Fig.l  Experimental  (solid  line)  and  calculated 

coating  layer  fabrication.  However,  even  with  a  fdashed  line]  transmission  spectrum  of  a  4-laver- 

non-ideal  optical  characteristics,  this  filter  shows 

very  interesting  features.  A  more  accurate  coating  fabrication,  that  is  a  larger  number  of  pairs  or  a  higher  coating 
films  quality,  would  produce  higher  finesse  structures.  For  inslnce,  sputtering  or  PECVD  deposition  of  SiQ,  Si3N4, 
and  a:  Si  would  furnish  layers  with  low  absorption  and  low  defect  density. 

A  very  important  feature  of  such  a  filter  is  the  possibility  to  tune  it,  that  is  to  move  the  transmission 
spectrum  with  respect  to  the  wavelength  axis.  We  verified  this  fact,  heating  the  filter  under  test  by  means  of  the 
Peltier  cell  and  measuring  the  transmitted  spectrum  at  different  temperatures.  In  fig.2  are  reported  several  spectra  of 
the  second  filter  above  described,  and  acqiired  at  various  Peltier  cell  temperature.  It  is  evident  how  the  resonance 
peak,  which  at  room  temperature  is  at-  1550.5  nm,  moves  toward  higher  wavelength  values  by  increasing  the 
temperature.  For  a  heating  of  about  32  °C  the  resonance  moves  at  values  beynd  ~  1553  nm.  For  further  temperature 
increases  the  resonance  peak  goes  out  of  the  observation  window  and  another  peak  appears  at  the  left  side,  coming 
from  lower  wavelength  region.  It  is  possible  to  conclude  that  by  means  of  a  moderate  heating,  from  room 
temperature  to  about  55  °C,  we  can  tune  the  filter  in  a  range  which  is  determined  by  the  filter  FSR.  In  this  interval, 

which  is  about  3.0  nm  wide,  we 


resolved  at  least  seven  50-GHz-spaced 
independent  channels,  accepting  a 
crosstalk  between  two  adjacent 
channels  of  about -10  dB1.  Moreover, 
during  the  tuning,  the  resonance  peak 
shape  does  not  change  and  the 
passband  bandwidth  remains  at  about 
0.24  nm  over  the  entire  tuning  range. 
Wider  wavelength  ranges  can  be 
unambiguously  explored  using  filters 
with  larger  FSRs,  i.e.,  with  shorter 
cavity  lengths. 
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Fig.2  Thermal  tuning  of  a  4-layer  coated  FP  filter 
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Summary 

Optical  cross-connect  switch  (OXC)  are  key  components  in  advanced  wavelength  division  multiplexed 
(WDM)  networks.  Integrated  optical  switches  have  been  reported  using  several  mechanisms  and  materials. 
We  have  demonstrated  waveguide-type  thermooptic  switches  which  employ  silica-based  planar  lightwave 
circuits  (PLC’s).  The  crosstalk  of  OXC,  caused  by  a  lot  of  reasons,  is  an  important  parameter  which 
influences  the  system  performance  [1,2].  One  of  the  reasons  is  the  coupling  ratio  (CR)  of  a  3dB  coupler  in 
a  switch  unit.  Due  to  the  fabrication  limitations,  it  is  hard  to  obtain  exact  CR  as  designed.  In  this  paper,  we 
propose  a  scheme  to  achieve  low  crosstalk  even  with  the  variation  of  CR. 

The  design  of  our  optical  switch  fabric  is  a  strictly  non-blocking  matrix  switch  using  dilated  double¬ 
layer  architecture  [3],  The  general  architecture  of  a  4x4  OXC  is  shown  in  Fig.  1(a). 

By  using  the  dilated  double  layer  switch  architecture,  there  is  always  only  one  signal  passing  through  a 
switch  unit.  It  ensures  that  there  is  no  first-order  crosstalk  by  this  switch  architecture  [4],  Every  switch  unit 
consists  of  a  2x2  cross-bar  switch  as  shown  in  Fig.  1(b),  for  the  Thermal  Optical  (TO)  switch  type. 

The  crosstalk  of  the  switch  unit  is  influenced  by  the  directional  coupler’s  CR  as  shown  in  Fig.  2(a). 
From  the  figure,  the  crosstalk  of  a  switch  unit  in  cross  state  degrades  to  -15  dB,  if  CR  has  10%  variation. 
Coupling  Spacing  (CS)  and  Coupling  Length  (CL)  as  shown  in  Fig.  1(b)  of  the  directional  coupler 
determine  the  CR.  The  formula  1  shows  the  relation  between  CL  and  CR  from  the  experiment  result. 
CL_offset  is  related  to  the  coupling  efficiency  by  the  bending  waveguides  connected  with  the  straight  CL. 
EJfjCL  is  defined  as  the  effective  CL.  CR  is  equal  to  50%,  when  CL  =  EffjCL  -  CL_offset. 

CR=(1  -cos(2n( CL + CL_offset)/(4  *Eff_CL)))/2  (1) 

The  effective  CL  is  only  determined  by  CS  as  shown  in  formula  2.  It  behaves  exponentially  to  the  CS 
by  BPM  simulation  as  shown  in  Fig.  2(b).  By  the  calculation,  CR  changes  10%  with  0.1  um  variation  of 
CS,  when  CS  should  be  5um  exactly.  Although  CL  and  CS  of  every  3dB  coupler  are  fixed  by  the 
photomask  design,  CR  is  sensitive  to  the  fabrication  variation.  For  example,  CS  of  directional  coupler  may 
vary  within  sub  micron  during  waveguide  etching  process. 

Eff_CL=eff_CL0*exp(CS/y),  where  eff_CL0  and  fare  process  dependent  parameters  (2) 

Fig.  3  shows  the  histogram  of  CR  of  all  3dB  couplers  in  a  2-inch  square  area  on  one  wafer.  The 
variation  of  CR  is  about  15%,  which  means  crosstalk  of  some  switch  units  may  be  worse  than  -15  dB  as 
mentioned  before.  Fig  3  also  shows  the  histogram  of  CR  among  10  different  wafers.  From  these  testing 
results,  the  variation  of  CR  directly  influences  the  crosstalk  of  thermal  optical  switch. 

In  order  to  lower  the  crosstalk  of  the  switch,  we  can  either  improve  the  fabrication  process  of  the 
directional  couplers,  for  example  to  make  CS  as  accurate  as  possible,  or  we  can  change  the  architecture  of 
the  switch  design.  The  dilated  double  layer  architecture  is  to  lower  the  crosstalk  level  to  the  second  order. 
The  second-order  crosstalk  happens  when  two  signals  with  the  same  wavelength  cross  the  middle  two-stage 
switch  units.  If  all  the  switch  units  are  designed  in  the  same  way,  the  degradation  of  crosstalk  in  single 
switch  unit  due  to  the  variation  of  CR  will  cause  the  double  degradation  in  second-order  crosstalk.  Since 
all  the  second-order  crosstalk  is  from  the  middle  two-stage  switch  units,  we  intentionally  use  different  CL 
of  switch  unit  in  stage  2  and  stage  3.  For  stage  2,  we  use  shorter  CL  design  which  is  related  to  the  crosstalk 
tolerance.  And  for  stage3,  longer  CL  is  used.  Due  to  fabrication  variation,  the  effective  CL  within  a  wafer 
is  different  from  the  designed  CL.  In  our  proposed  design,  the  different  CLs  in  stage  2  and  3  will 
compensate  each  other  for  lower  crosstalk.  Fig.  4  shows  the  simulation  result  of  the  second  order  crosstalk 
with  different  ACL,  where  ACL=  CL-of-stage3  -  CL-of-stage2.  The  second  order  crosstalk  of  the  proposed 
design  is  not  as  good  as  ideal  CR  case,  if  CR  is  within  5%  variation,  But  it  maintains  certain  crosstalk  level. 
We  see  that  the  second  order  crosstalk  begins  to  drop  in  the  proposed  design,  when  CR  is  larger  than  5% 
variation.  For  example,  the  crosstalk  of  a  ACL  =200  um  design  (ideal  CL=1300um)  is  6  dB  less  than  of 


0-7803-6252-7/00/$  1 0.00©2000  IEEE 


47 


ACL  =0  design,  when  CR  varies  within  ±7.5%  which  is  reasonable  within  one  wafer  and  among  wafers. 
From  another  point  of  view,  the  proposed  design  can  tolerate  more  CR  variation  than  the  original  design 
within  certain  crosstalk  limitation.  For  example,  with  the  requirement  of  crosstalk  level  under  -40  dB,  the 
original  design  only  can  tolerate  10%  CR  variation.  But  the  design  of  ACL=200  um  can  tolerate  15%  CR 
variation.  The  range  of  ACL  is  determined  by  the  crosstalk  requirement  and  the  variation  of  CR. 

In  summary,  we  have  reported  that  the  different  CL  design  in  the  middle  two  stage  switch  units 
extends  tolerant  range  of  the  fabrication  of  switch  fabric  under  certain  crosstalk  limitation. 
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(a)  (b) 

Fig.  1  (a)  The  architecture  of  the  non-crosstalk  4x4  OXC  design  (b)Single-MZI  switching  unit  configuration,  CL  is  the 
coupling  length. 


Fig.  2  (a)  CR  vs.  switch  unit  crosstalk  in  cross-state  (b)  effective  CL  vs.  CS 


Fig.  3  the  variation  of  CR  within  one  wafer  and  among  different  wafers  Fig.  4  The  second  order  crosstalk  vs.  the  CL  difference 

between  switches  in  stage  2  and  3. 
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Abstract:  Promising  candidates  for  applying  optical  technologies  to  next-generation  networks  include 
optical  self-healing  for  the  transport  function  and  an  ultra-large-scale  optical-switch  fabric  for  the 
grooming/multiplexing  function  have  been  summarized. 

1.  Introduction 

The  tremendous  growths  of  Internet  traffic  and  the  continually  increasing  demand  for  multimedia 
services  have  generated  a  need  for  increased  transmission  capacity  and  switching-systems  throughput  in 
next-generation  broadband-data  core  networks.  Wavelength  division  multiplexing  (WDM)  offers  easy  and 
cost-effective  capacity-expansion  by  means  of  optical  amplifier  repeaters,  and  it  has  been  widely  applied  to 
relieve  congestion  in  public-transport  networks.  With  regard  to  increasing  switching  throughput,  however, 
the  ultra-high-throughput  optical  switching  systems  under  research  can  only  be  produced  by  using 
complicated  optical-signal  processing  and  large-capacity  optical-data  buffering,  and  the  optical  switching 
systems  seems  to  be  a  long  way  off.  The  introduction  of  optical-switch  fabrics  into  ultra-high-throughput 
switching  systems  therefore  appears  to  be  a  more  immediately  promising  approach  to  reducing  size  and 
total  power  consumption. 

This  paper  presents  our  general  views  on  next-generation  beyond-Tb/s  optical  networking,  including 
optical  self-healing  transport  network  technology  for  next-generation  transport  networks  and  ultra-high 
throughput  optical-switch  fabric  technology  for  Tb/s-throughput  switching  systems. 

2.  Necessary  functions  for  next- 
generation  optical  core  networks 

According  to  our  vision,  a 
beyond-Tb/s-capacity  optical  core 
should  have  two  functions:  1)  a 
transport  function  for  supporting  the 
data  transportation  with  a  reliable 
self-healing  function  and  cost- 
effective  WDM  resource  usage;  and 
2)  a  grooming/multiplexing 
function,  which  sorts  the  various 
services  by  type  and  destination  to 
wide  bandwidth  paths  for  simple  and 
cost-effective  transportation.  From  a 
technical  standpoint,  it  is  practical  to 
limit  optical  technologies' 
application  to  an  optical  self-healing 
for  the  transport  function  and  an 
ultra-large-scale  optical  switch 
fabric  for  the  grooming/multiplexing 
function. 

3.  Optical  self-healing  transport  network 

Figure  1  shows  a  scenario  for  introducing  the  self-healing  function  into  transport  networks.1*  WDM 
point-to-point  systems  are  already  in  commercial  application  use  with  SONET/SDH  protection  capability. 
Due  to  the  strong  demand  for  large-transmission  bandwidths  in  the  use  of  the  Internet  and  multimedia- 
related  traffic,  however,  a  new  transport  network  that  possesses  a  bandwidth  expansion  capability  and  a 
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flexible/open  interface.  WDM  optical  networks  have  the  capability  to  accommodate  multi-Gb/s  and  bit- 
rate/format-independent  demands  as  WDM  channels,  so  they  are  very  promising  if  they  have  self-healing 
functionality. 

There  are  two  approaches  to  producing  optical  self-healing  networks.  One  is  to  configure  a  physical- 
mesh  topology  network  with  optical  cross-connects  (OXCs),  and  the  other  is  to  configure  a  physical  ring 
topology  network  with  optical  ADMs  (O-ADMs  or  WDM  ADMs).  Ring-topology-based  optical  self- 
healing  networks^  are  preferred  as  a  first  step  because  1)  O-ADM  is  more  economical  than  OXC,  2) 
protection  speed  in  ring  topology  is  much  faster,  and  3)  O-ADM  is  more  transparent  to  data  bit-rate  and 
format.  Eventually  though,  in  photonic  networks  of  the  future,  multiple  optical  self-healing  rings  will  be 
connected  by  emerging  large-scale  OXC/photonic  nodes.2) 

4.  WDM  Self-healing  ring-transport  network 

From  the  viewpoints  of  application  requirements  supported,  available  monitoring  technology  (e.g., 
availability  of  network  protection  and  network  management),  and  cost-effectiveness  (e.g.,  number  of 
wavelengths  needed  and/or  required  network  resources  for  a  fully  logically  meshed  demand  pattern),  the 
BWPSR  (bi-directional  wavelength  path-switched  ring)  is  the  best  alternative.  It  is  based  on  bi-directional, 
wavelength-based  protection  and  a  wavelength-based  protection-trigger  self-healing  ring  network.  We 
have  developed  a  prototype  of  B WPSR-based  WDM  ring  network,  and  confirmed  the  network's  protection 
speed  is  as  fast  as  that  of  SONET/SDH  ring  networks 

5.  Ultra-large-scale  switching  systems  with  optical  SW  fabric 

In  beyond-Tb/s-capacity  optical 
networks,  the  Tb/s-throughput 
grooming/multiplexing  function  plays  an 
important  role  in  making  the  network 
simple  and  cost-effective.  State-of-the- 
art  electronics  technology  can  not  easily 
produce  a  Tb/s-throughput  switch  fabric 
that  is  non-optical,  and  this  incapability 
seems  to  be  a  prime  first  target  for  the 
applications  of  optical  technologies. 

Figure  2  shows  an  image  of  an  ultra- 
high-throughput  switching  system  with 
an  optical  SW  fabric.  For  better  network 
resource  utilization,  a  Tb/s-throughput 
optical  switch  is  used  for  WDM  optical 
path  grooming/multiplexing  (OLXC), 

ATM  SW  capacity  expansion,  and  IP- 
router  capacity  expansion. 

We  have  developed  a  prototype  of  a 
photonic  node  by  using  our  proposed 
WD/SD  optical  switch  fabric,  optical 
semiconductor  amplifier  gate  switches,  and  super-cell  switching  technologies.  At  present,  the  prototype 
has  ATM  cell  handling  capability,  but  we  plan  to  expand  the  handling  capability  of  the  optical  channel 
(OCh)  and  the  IP. 

5.  Conclusion 

This  paper  described  the  necessary  functions  of  next-generation  optical  core  networks.  We  believe  that 
promising  candidates  for  applying  optical  technologies  to  next-generation  networks  include  optical  self- 
healing  for  the  transport  function  and  an  ultra-large-scale  optical  switch  fabric  for  the 
grooming/multiplexing  function.  We  have  developed  the  prototypes  of  a  WDM  self-healing  network  and  a 
photonic  node.  Fundamental  technologies  have  been  confirmed  by  evaluating  the  prototypes,  and  we 
found  that  cost-effectiveness  is  one  of  the  main  issues  to  be  addressed  in  the  near  future.  It  is  hoped  that, 
with  advanced  and  cost-effective  optical-switching  technologies,  photonic  networks  will  be  able  to  support 
beyond-Tb/s  networking  in  the  near  future. 

[References]  1)  N.  Henmi  et.  al.,  EURESCOM  P615,  “OADM-workshop”,  p.36,  1998;  2)  Y.  Maeno  et. 
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Introduction  There  is  tremendous  activity  in  burst  mode  routing  and  control  traffic  for  next  generation 
internet  (NGI).  For  many  applications  it  is  desirable  that  data  be  transported  from  one  point  in  a  network 
to  one  or  more  other  points  in  the  least  possible  time.  For  some  applications  the  time  sensitivity  is  so 
important  that  minimum  delay  is  the  overriding  factor  for  all  protocol  and  equipment  design  decisions.  A 
number  of  schemes  have  been  employed  to  meet  this  requirement,  including  both  signaling  based  and 
equipment  intensive  solutions.  This  paper  compares  several  means  available  for  providing  burst  mode 
switching  capabilities  in  a  WDM  optical  network,  and  reports  some  test  results  on  a  novel  hybrid  optical 
network  switch. 

The  Need  For  Speed  In  order  to  appreciate  the  impetus  for  designing  burst  mode  switching  networks  it  is 
useful  to  consider  the  delays  that  are  encountered  in  data  networks.  It  takes  20.5  milliseconds  for  light  to 
travel  from  San  Francisco  to  New  York  City  in  a  straight  line  through  an  optical  fiber,  without 
considering  any  intervening  equipment  or  a  realistic  route.  Many  transmission  protocols  require  that  data 
packets  traverse  the  network  only  after  a  circuit  has  been  established.  The  circuit  setup  phase  in  TCP/IP, 
for  example,  involves  three  network  traversals  (sending  a  SYN  packet,  responding  with  an  ACK,  and 
completing  the  procedure  with  another  ACK)  before  any  data  packets  can  be  sent.  While  this  helps  ensure 
reliable  data  transport  it  also  guarantees  a  minimum  network  delay  of  more  than  80  milliseconds  before 
data  can  be  received.  For  some  applications,  such  as  periodic  transmission  of  sensor  data,  this  may  be  an 
unnecessary  overhead  to  impose. 

Reducing  the  Bottleneck  One  way  to  mitigate  the  impact  of  network  setup  time  on  packet  latency  is  to 
pipeline  the  signaling  messages.  Two  methods  for  accomplishing  these  timesavings  are  presented  in  Ref. 
1,  which  discusses  Cut-at-Confirm  and  Cut-at-Setup.  Both  of  these  methods  allow  the  first  packet  delay  to 
be  reduced  from  two  round-trip  times  to  one  and  one  half  round  trip  times.  The  scheme  has  been 
implemented  and  is  currently  undergoing  field  testing. 

Signaling  Based  Burst  Mode  Transmission  One  paradigm  for  providing  a  burst  mode  switching 
capability  using  conventional  equipment  is  to  send  a  signaling  packet  prior  to  the  actual  data  packet 
transmission,  and  transmitting  the  data  while  the  setup  is  in  progress.  An  interesting  description  of  a 
system  that  uses  this  paradigm  is  presented  in  Ref.  2.  This  system  uses  a  dedicated  wavelength  for 
signaling  control  information,  and  a  Burst  Storage  Unit  in  order  to  buffer  packets  when  necessary.  An 
implementation  friendly  link  scheduling  algorithm  is  used  in  order  to  insure  efficient  utilization  of  the 
WDM  channels. 

A  Hybrid  Approach  Another  system  that  uses  pipelined  signaling  with  data  transmission  to  optical  burst 
switching  was  constructed  for  use  in  our  optical  testbed.  The  system  includes  Network  Interface  Units 
(NIU)  and  Optical  Crossbar  Switches  (OXBS).  In  this  system  data  packets,  which  originate  as  groups  of 
cells  in  a  conventional  ATM  over  SONET  OC-12  network,  are  transformed  into  optical  bursts  at  the  NIU. 
The  bursts  travel  through  one  or  more  OXBS  units  and  exit  the  optical  network  at  another  NIU.  The 
destination  NIU  reformats  the  transmission  back  into  its  original  SONET  format  and  passes  it  on  to 
another  ATM  network.  The  OXBS  has  input  ports  and  output  ports  that  may  be  connected  in  a  one-to-one 
or  one-to-many  configuration,  accommodating  both  point-to-point  and  multicast  operation.  Several  of 
these  units  were  built  for  a  U.  S.  government  project  and  have  been  evaluated  at  Telcordia  Technologies. 

The  design  of  this  system  uses  ATM/SONET  interfaces  to  receive  packets  on  an  inbound  link  to  an  NIU. 
The  packets  are  placed  in  a  fast  memory  buffer,  and  a  request  to  the  OXBS  is  sent  over  a  dedicated 
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signaling  wavelength.  The  request  is  encoded  in  an  OC3  ATM/SONET  cell  which  tells  the  OXBS  to 
establish  a  connection  after  a  few  microseconds.  The  OXBS  can  then  pass  further  setup  information  to 
downstream  OXBSs,  requesting  connections  on  the  path  to  the  final  NIU.  Data  frames  are  sent  through 
the  network  over  these  connections  in  8B/10B  encoding,  which  allows  for  simpler  encoding  and  easier 
synchronization. 


Since  the  efficient  utilization  of  the  OXBS  requires  a  very  fast  optical  switch,  a  characterization  of  the 
switching  speed  was  performed.  The  OXBS  uses  semiconductor  optical  amplifier  technology  for  the 
crosspoint  switching  elements.  The  trace  below  shows  that  both  the  rise  and  fall  times  were  shorter  than 


9  nanoseconds,  indicating  that  a  very  efficient  design  was  employed.  A  short  switching  time  is  very 
important  for  handling  fine  grain  packets  such  as  TCP  ACK  packets,  which  are  only  40  bytes  long.  Also 
shown  is  an  eye  diagram  of  a  OC48  signal  after  passing  through  the  OXBS. 

Pure  Burst  Mode  Transmission  The  alternative  to  signaled  data  transmission  is  data  switching  using 
header  information.  An  example  of  this  type  of  network  is  described  in  Ref.3,  and  was  demonstrated  at 
OFC  2000.  In  this  design  a  data  packet  is  sent  simultaneously  with  a  header  that  contains  routing 
information.  The  headers  are  carried  along  with  the  data,  but  out  of  band,  using  subcarrier  multiplexing. 
At  switch  nodes  inside  the  network,  the  optical  signal  is  sampled  just  prior  to  entering  a  short  fiber  delay 
line.  While  most  of  the  signal  is  being  delayed,  a  fast  packet  processing  engine  determines  the  correct 
state  for  the  switching  fabric,  based  on  the  incoming  header  and  a  local  forwarding  table.  The  switch 
fabric  is  commanded  to  enter  a  new  state  just  before  the  packet  exits  the  delay  line  and  enters  the  switch. 
This  method  provides  the  lowest  possible  latency  for  packet  transmission  and  removes  the  onerous  task  of 
pre-calculating  the  arrival  time  of  burst  mode  data  packets. 

Conclusion  We  have  tested  a  novel  hybrid  system  and  constructed  a  burst  mode  optical  networking 
testbed.  There  are  several  ways  to  accomplish  burst  mode  transmission  in  a  WDM  optical  network, 
including  methods  that  rely  on  signaling  in  advance  of  data  packet  arrival  and  pure  burst  mode 
transmission  of  packets  with  no  a  priori  indication.  This  paper  discusses  two  designs  from  the  literature 
and  also  reports  on  a  novel  hybrid  system  that  was  evaluated.  Finally,  a  testbed  that  offers  true  optical 
burst  mode  capability  is  reported.  This  study  provides  a  foundation  for  evaluating  results  of  various 
methods  that  provide  burst  mode  capability  in  WDM  optical  networks. 
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Abstract:  All-optical  signal  regeneration  techniques  are  presented:  fiber  and  semiconductor  based  devices  are 
addressed,  and  some  3R  signal  regeneration  schemes  are  reviewed. 


Introduction 

Record  terabit/s  point-to-point  transmission  systems  over 
more  than  2000  km  have  recently  been  reported  [1],  based 
on  dispersion  managed  non-linear  transmission  techniques, 
without  need  for  any  regeneration  device,  except  for  simple 
linear  optical  amplifiers.  However,  if  terrestrial  photonic 
networks  featuring  dynamic  routing  capability  are 
considered,  more  sophisticated  regeneration  schemes  could 
be  necessary  in  order  to  compensate  for  signal  quality 
discrepancies  between  high  data  rate  WDM  channels  which 
could  be  routed  at  different  times  over  variable  distances. 
Although  classical  optoelectronic  regenerators  constitute 
an  attractive  solution  for  single  channel,  single  data  rate 
transmission,  it  is  not  clear  whether  they  could  be  a  cost 
effective  solution  in  a  tremendously  growing  data 
bandwidth  scenario.  All  optical  regenerators,  based  on 
components  which  can  process  low  as  well  as  high  bit  rates 
signals  could  be  an  interesting  alternative.  In  this  summary, 
we  briefly  review  2R  (reshaping  repeater)  and  3R 
(retiming,  reshaping  repeater)  optical  regeneration  schemes 
for  high  speed  optical  signal  processing,  based  on  different 
technologies  such  as  fibers  or  semiconductors 

1  Main  signal  degradations: 

Main  signal  degradations  in  fiber  systems  arise  from 
amplified  spontaneous  emission  (ASE)  due  to  optical 
amplifiers,  pulse  spreading  due  to  group  velocity 
dispersion  (GVD),  which  can  be  corrected  through  passive 
dispersion  compensation  schemes,  polarisation  mode 
dispersion  (PMD),  and  finally  non-linear  effects  due  to 
Kerr  non-linearity  contributing  to  signal  distortion  and 
jitter  above  10  Gbit/s. 

2  Basic  principle  of  optical  regeneration: 

2.1  2R  regeneration: 

If  time  jitter  is  negligible,  simple  amplification  and 
reshaping  processes  are  usually  sufficient  to  maintain 
signal  quality  over  long  distances  by  preventing  from  noise 
and  distortion  accumulation.  A  2R  regenerator  mainly 
consists  of  a  linear  optical  amplifier  followed  by  a  data- 
driven  non-linear  optical  gate  (NLOG)  which  modulates  a 
"clean"  (low  noise)  CW  laser  light  source.  If  the  gate 
transmission  versus  signal  intensity  characteristics  yields  a 
kind  of  thresholding  and  limiting  behaviour,  then  signal 
extinction  ratio  can  be  improved  and  ASE  amplitude  noise 
partly  reduced.  In  addition,  the  accumulated  chirp  can  be 
wiped  off. 
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2.2  3R  regeneration: 

When  jitter  accumulation  is  also  a  problem,  e.g.  owing  to 
cross-phase  modulation  (XPM)  in  WDM  systems,  or  pulse 
edge  distortions  due  to  finite  response  time  of  non-linear 
analog  signal  processing  devices  (e.g.  wavelength 
converters),  then  3R  regeneration  might  be  necessary.  Fig. 
1  shows  the  basic  structure  of  an  optical  3R  regenerator 
[2],  which  mainly  consists  of  an  optical  amplifier,  a  clock 
recovery  block  providing  an  unjittered  short  pulse  clock 
stream,  which  is  then  modulated  by  a  data  driven  non¬ 
linear  optical  gate  block. 


Figure  1:  Principle  of  optical  3R  regeneration 


Let's  now  review  some  recent  approaches  for  these 
different  building  blocks. 

3  Regenerator  main  building  blocks. 

The  core  of  an  optical  regenerator  is  a  non-linear  gate 
featuring  signal  extinction  ratio  enhancement  and  noise 
reduction. 

3.1  Fibre  based  gates: 

Kerr-shutter  [3]  and  non-linear  optical  loop  mirror 
(NOLM)  [4]  are  probably  the  first  studied  gates  for 
regeneration  applications.  The  ultra  fast  response  of  Kerr 
non-linearity  is  a  major  advantage  of  these  devices  for  high 
speed  signal  processing.  Using  a  Kerr-shutter,  a  40  Gbit/s 
optical  regenerator  has  been  reported  [3],  But  the  weak 
silica  fiber  non-linearity  requires  typical  peak 
power  x  length  products  of  0.5  W  x  km.  Long  fibre  lengths 
(several  km)  and  polarisation  sensitivity  to  incoming  signal 
are  drawbacks  for  WDM  field  deployment.  Recently,  a 
new  polarisation  insensitive  gate  based  on  self  phase 
modulation  spectrum  filtering  was  reported  [5],  with  a 
good  limiting  behaviour. 

3.2  Semiconductor  based  optical  gates: 

Although  not  as  fast  as  fibre  based  devices,  these  gates  are 
much  more  compact.  One  can  distinguish  between  “  active 
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devices  ”,  i.e.  that  require  an  electrical  power  supply  like 
semiconductor  optical  amplifier  (SOA)  based  devices,  and 
“  passive  devices  ”,  like  saturable  absorbers  [6], 

Among  active  semiconductor  gates,  work  has  been 
focused  on  bistable  lasers  [7]  and  mostly  on  SOA  based 
devices .  First  reported  SOA  based  optical  gates  were 
Sagnac  interferometer  incorporating  one  SOA  as  the  non¬ 
linear  element  [8,9].  However  this  gates  are  much  better 
suted  for  high  speed  OTDM  demultiplexing  [10]  than  for 
3R  regeneration  which  requires  a  gate  with  a  well  shape 
rectangular  window  when  switching  data  pulses  are 
somewhat  large.  Other  high  speed  and  well  shaped  optical 
gates  are  SOA-based  Mach-Zehnder  or  Michelson 
interferometers  [11,12],  which  can  be  data  driven  with 
polarisation  independence  up  to  40  Gbit/s,  with  excellent 
performances  [13],  Main  features  of  interferometric  non¬ 
linear  gates  are  signal  extinction  ratio  enhancement  and 
noise  variance  reduction.  100  Gbit/s  optical  logic  gate 
based  on  an  ultrafast  nonlinear  interferometer  (UNI)  using 
an  SOA  in  a  birefringent  interferometer  was  reported  [14], 
while  feasibilility  of  data  regeneration  at  40  Gb/s  with  this 
gate  was  reported  [15]. 

3.3  Clock  recovery: 

For  optical  3R  regeneration,  a  synchronous  unjittered  short 
pulse  stream  clock  has  to  be  recovered  from  incident 
signal.  Many  solutions  meeting  these  requirements  have 
been  reported,  which  cannot  all  be  reviewed  here.  We  only 
focus  on  polarisation  independent  and  potentially  ultrahigh 
speed  schemes  for  RZ  formats.  All-optical  clock  recovery 
through  optical  injection  of  mode-locked  lasers,  either  in 
semiconductor  [16]  or  Erbium  doped  fibre  [17]  could  yield 
low  jitter,  short  pulse  clocks.  Phase  locked  loop  techniques 
using  a  high  speed  non-linear  optical  element  as  phase 
comparator  [18],  could  also  yield  good  performance  above 
100  GHz.  Recently,  a  self-pulsating  laser  clock  recovery 
was  demonstrated  at  40  GHz  [19,20]. 

4  Some  recent  results  on  optical  regeneration: 

We  shortly  review  here  some  recent  results  on  polarisation 
independent  regeneration  schemes  for  data  rates  above  10 
Gbit/s.  Although  2R  regeneration  looks  very  attractive 
because  of  its  simplicity  and  bit  rate  flexibility,  it  is  not 
clear  whether  it  could  be  used  above  10  Gb/s  because  it 
would  require  devices  with  very  short  transition  time 
response. 

4.1.  3R  regeneration: 

“  All-electronic  ”  signal  processing  is  rapidly  progressing, 
with  recent  demonstration  of  a  40  Gbit/s  optoelectronic 
repeater  using  InP-HEMT  technology  [21],  Several  3R 
optical  regenerator  architectures  have  also  been 
demonstrated.  All  optical  fibre  devices  were  first  reported, 
and  have  shown  now  20  [22],  and  40  Gbit/s  [3]  operation 
capability.  On  the  semiconductor  technology  side,  hybrid 
regenerator  structure  using  optoelectronic  clock  recovery 
stage  and  all  optical  SOA  based  non-linear  interferometric 
gates  have  been  demonstrated  (with  BER  assaisment)  at  10 
[23],  20  [24,25]  and  40  Gb/s  [26],  Cascadability  feasability 
of  up  to  1000  regenerators  in  the  recirculation  loop  has 
been  reported  [27]  with  multistage  interferometric  gates 
(cascaded  non-linear  gates  improve  signal  regeneration 
[28]). 


Finally,  InP  electro-optic  modulators  have  successfully 
been  used  for  polarisation  insensitive  in-line  amplitude  and 
phase  synchronous  modulation  of  soliton  signals  up  to  40 
Gb/s,  enabling  error-free  transmission  over  more  than 
20  000  km  [29]. 

5.  Conclusion: 

High  speed  WDM  photonic  networks  will  probably  require 
signal  regenerators.  Although  significant  progress  has  been 
achieved  these  two  last  years,  it  is  not  yet  clear  to-day 
whether  all  building  blocks  of  a  3R  all  optical  regenerator 
will  be  as  easily  integrated  as  for  an  electronic  device,  with 
cost  effective  targets,  particularly  in  a  dense  WDM  context. 
This  means  that  optical  signal  processing  solutions  should 
yield  superior  or  not  otherwise  available  functionalities, 
such  as  bit  rate  flexibility,  for  example. 
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1.  Introduction 

Nonlinear  Optical  Loop  Mirrors  based  on  Transparent  Semiconductor  Optical  Amplifiers  (TSOA- 
NOLMs)  exhibit  fast  switching  with  a  recovery  time  of  the  order  of  a  few  picoseconds,  cascadable 
switching  functionalities,  low  cross-talk  between  adjacent  channels  and  low  noise  figures  [1,2].  When 
an  SOA  is  biased  such  that  the  optical  gain  exactly  compensates  the  linear  waveguide  loss,  the  SOA 
is,  in  general,  not  transparent  to  an  injected  intense  pump  pulse  since  Two-Photon  Absorption 
(TPA)  and  Ultrafast  Nonlinear  Refraction  (UNR)  respectively  absorb  the  pulse  energy  and  broaden 
its  spectral  width  [3].  In  a  TSOA-NOLM,  following  the  passage  of  the  pump  pulse,  the  SOA  may 
be  considered  to  be  transparent  to  the  probe  pulse  if  the  carrier  density  is  immediately  restored  to 
the  value  existing  prior  to  the  arrival  of  the  pump  pulse.  Hence  the  output  energy  of  the  probe 
probe  pulse  is  independent  of  the  time  delay  between  the  probe  and  pump  pulses.  The  transparency 
carrier  density  is,  however,  significantly  dependent  upon  the  pump  pulse  power  [1].  Furthermore, 
due  to  the  small  net  optical  gain  in  a  TSOA-NOLM,  a  high  pump  pulse  power  is  usually  required 
to  achieve  a  specific  switching  output  energy.  Therefore,  both  the  strong  pump  power  dependence 
of  the  transparency  carrier  density  of  SOAs  and  the  requirement  of  high  switching  powers  would 
seriously  limit  the  utilisation  of  TSOA-NOLMs  in  practice. 

The  pump  power  dependent  transparency  carrier  density  of  an  SOA  was  measured  [1]  using  a 
pump-probe  configuration,  and  in  that  work  the  switching  characteristics  of  a  TSOA-NOLM  were 
also  reported.  In  this  paper,  calculations  predict  that  long  SOAs  (>500/jm)  can  be  employed  in 
TSOA-NOLMs  to  not  only  considerably  reduce  the  pump  power  dependence  of  the  transparency 
carrier  density  of  the  SOAs  but  also  to  significantly  increase  the  transmission  of  the  switches. 

2.  Results  and  Discussions 

To  simulate  the  pump  power  dependent  transparency  carrier  density  of  SOAs,  calculations  are 
performed  of  a  pump-probe  configuration.  An  optical  pulse  model  [4]  that  governs  co-propagating 
strong  pump  and  weak  probe  pulses  in  SOAs  is  adopted.  Moreover,  to  compute  the  switching 
behaviour  of  TSOA-NOLMs,  a  comprehensive  model  [5]  that  determines  the  output  properties  of 
the  Sagnac-interferometer  devices  using  SOAs  as  the  nonlinear  elements  is  utilised.  To  accurately 
describe  the  optical  gains  of  SOAs,  a  detailed  material  gain  model  [6]  is  employed  here,  from 
which  the  unsaturated  optical  gain,  the  differential  gain  coefficient  and  the  carrier  density  linewidth 
enhancement  factor  can  be  computed  as  functions  of  optical  pulse  wavelength,  carrier  density  and 
temperature. 

The  simulated  dependence  of  the  transparency  carrier  density  on  the  input  pump  pulse  energy 
is  illustrated  in  Fig.l  for  different  SOA  lengths.  The  behaviour  of  the  transparency  carrier  density 
shows  good  agreement  with  experimental  measurements  [1].  In  particular,  Fig.l  also  shows  that  the 
pump  pulse  power  dependence  of  the  transparency  carrier  density  can  be  considerably  decreased 
by  employing  long  SOAs.  Two-Photon  Absorption  (TPA)  and  TPA-induced  carrier  creation  are 
dominant  in  determining  the  carrier  density  behaviour.  When  the  pump  pulse  propagates  along 
the  SOA,  these  effects  are  gradually  reduced  because  of  the  reduced  pump  powers.  For  a  long 
device,  the  average  optical  gain  per  unit  length  experienced  by  the  probe  pulses  after  the  arrival 
of  the  pump  pulses  is  lower  in  comparison  to  that  of  the  short  SOA.  A  reduced  dependence  of  the 
transparency  carrier  density  is  thus  achieved. 
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Input  Dump  pulse  enerav  (pJ) 


Figure  1:  The  transparency  carrier  density  as  a 
function  of  the  input  pump  pulse  energy  for  differ¬ 
ent  SOA  lengths.  The  SOA  lengths  for  the  dot¬ 
ted,  solid  and  dashed  lines  are  125^m,  250 ym  and 
500/Jin  respectively. 


Input  pump  pulse  enerav  (eJl 


Figure  2:  The  probe  pulse  energy  transmission  ver¬ 
sus  the  input  pump  pulse  energy  in  a  TSOA-NOLM. 
The  SOA  lengths  for  the  dotted,  solid  and  dashed 
lines  are  125pm,  250pm  and  500pm  respectively. 


Fig.2  demonstrates  the  probe  pulse  energy  transmission  of  TSOA-NOLMs  (ratio  between  out¬ 
put  and  input  probe  pulse  energies)  as  a  function  of  the  input  pump  pulse  energy.  It  is  shown 
that  an  increase  in  the  transmission  with  increasing  the  pump  pulse  energy  and  the  existence  of 
transmission  saturation  at  even  stronger  pump  pulse  energies.  Again  these  features  agree  very  well 
with  experimental  measurements  [1].  It  is  interesting  to  note  that  long  SOA  devices  can  signif¬ 
icantly  enhance  the  probe  pulse  energy  transmission.  The  features  of  the  probe  transmission  in 
Fig.2  occur  as  a  direct  results  of  the  inter  play  between  phase  and  amplitude  differences  between 
two  counter  propagating  signal  pulses  in  the  loop.  A  long  SOA  produces  a  large  phase  change  and 
hence  enhances  significantly  the  probe  pulse  energy  transmission. 

In  conclusion,  an  approach  of  utilising  long  SOAs  in  TSOA-NOLMs  has  been  shown  to  reduce 
the  pump  power  dependence  of  the  transparency  carrier  density  of  the  SOA  and  simultaneously 
enhance  the  probe  pulse  transmission  of  the  device. 

This  work  was  supported  by  the  UK  EPSRC  under  grant  GR/M65717. 
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Multimode  interference  (MMI)  couplers  have  found  wide  application  in  integrated  optical 
circuits.  These  couplers  are  based  on  self-imaging  multimode  waveguides  and  have  been 
shown  to  be  very  effective  in  splitting  and  combining  optical  beams.  Their  advantages 
include  compact  size,  ease  of  fabrication,  low  excess  loss,  good  crosstalk  performance  and 
accurate  splitting  ratios. 

Good  power  balance  and  stable  relative  phases  of  the  MMI  couplers  make  them  ideal  for 
incorporation  into  generalized  Mach-Zehnder  interferometer  (GMZI)  structures.  GMZI 
wavelength-selective  switches,  variable-ratio  power  splitters  and  optimized  switches  have 
been  reported  [1].  However,  the  switching  capability  of  these  devices  has  been  very  limited 
so  far.  The  purpose  of  this  paper  is  to  further  investigate  the  switching  possibilities  of  the 
GMZI  and  to  propose  a  novel  design  for  an  optical  GMZI  switch. 

The  general  layout  of  an  NxN  GMZI  is  illustrated  in  Fig.  1.  Identical  4x4  MMI  couplers  are 
used  as  splitter  and  combiners.  They  are  linked  by  N  arms,  each  containing  a  phase  shifter, 
either  electro-optic  or  thermo-optic  in  nature. 


Inputs  Phase  shifters  Outputs 


Fig  1.  Structure  of  an  NxN  generalized  Mach-Zehnder  interferometer  (GMZI). 

The  transfer  matrix  T  for  the  NxN  coupler  ideally  is  given  by 

=  «.,  (1) 
where  amn  is  the  (field)  amplitude  transfer  coefficient,  and  the  phase  <pmn  is  associated  with 
imaging  an  input  m  to  an  output  n  [1], 

The  overall  transfer  matrix  S  of  the  generalized  Mach-Zehnder  interferometer  can  be  written 
as  follows 
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S  =  TAT  (2) 

where  A  is  the  diagonal  matrix  representing  the  phase  shifts  in  the  arms,  and  T  is  the  transfer 
matrix  for  the  NxN  couplers. 

Since  the  phase  shifting  arms  provide  only  N  degrees  of  freedom,  then  only  N  of  the  total  N! 
number  of  switching  states  can  be  produced  by  such  a  structure.  The  phase  shifts  required  in 
the  arms  for  each  state,  are  given  by 

A  =  r'S.r'  (3) 

where  Sa  is  one  of  the  N  allowable  switching  states. 

However,  for  a  switch  to  be  useful,  it  needs  to  provide  all  possible  switching  states.  For  the 
case  of  the  4x4  switch,  there  are  4!  =  24  separate  states.  The  4x4  GMZI  can  be  modified,  by 
adding  some  additional  logic,  to  enable  it  to  provide  the  complete  range  of  switching  states. 
A  complete  switch  is  shown  in  Fig  2. 


Fig  2.  Proposed  architecture  for  a  4x4  optical  switch. 


Thus  by  the  addition  of  only  four  2x2  switches  (which  could  be  2x2  GMZI  switches)  all  the 
switching  states  are  achieved.  The  major  advantages  of  this  design  are  that  the  losses  are 
uniform  across  the  paths  and  that  the  switching  states  are  achieved  using  only  a  very  small 
number  of  phase  shifters.  External  crossovers  are  avoided.  The  switch  is  rearrangeably  non- 
blocking.  However,  it  is  possible  to  allocate  the  connections  of  the  switch  in  such  a  way  as  to 
reduce  the  probability  of  rearrangement.  The  switch  also  has  some  fault-tolerant  properties. 

In  summary,  a  brief  outline  of  the  switching  capabilities  of  the  GMZI  has  been  given.  This 
study  has  led  to  the  design  of  a  novel  4x4  switch.  This  type  of  switch  has  good  loss 
uniformity  and  requires  only  a  relatively  small  number  of  phase-shifting  or  switching 
elements. 
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We  conducted  perfectly  synchronized  bit-parallel  WDM  data  transmission  over  a  single  optical 
fiber.  The  demonstrated  technique  is  applicable  to  scenarios  where  bit-level  synchronization  among 
parallel  WDM  channels  is  required.  In  bit-Parallel  WDM,  each  bit  of  a  byte  or  a  WORD  is  trans¬ 
ported  in  a  different  wavelength  over  the  optical  fiber.  Bit-parallel  WDM  data  transmission  is 
suitable  for  interconnecting  computers  since  processors  read  and  write  data  in  a  byte-wise  fashion. 
Due  to  group  velocity  dispersion  (GVD),  however,  different  bit-parallel  wavelengths  do  not  arrive 
simultaneously  at  the  receiver.  The  best  accuracy  of  present  methods  [1,  2]  for  compensating  the 
time  skew  is  3 ps  for  2.5 km  fiber,  therefore  30ps  for  25 km. 

The  bit-parallel  WDM  data  is  transmitted  over  a  27.5 km  Corning  dispersion  shifted  fiber  (DSF). 
The  experimental  setup  of  the  demonstration  consists  of  three  parts:  the  input  stage,  the  syn¬ 
chronization  stage,  and  the  output  stage,  as  illustrated  in  Fig.  1.  The  input  stage  involves  three 
narrow  linewidth  CW  lasers  (one  tunable  diode  laser  at  wavelength  Ai  =  1563.2nm  and  two  DFBs 
centered  at  wavelengths  A2  =  1540.9nm,  A3  =  1539.5nm,  respectively)  and  the  DSF  to  introduce 
lAQps  time  skew.  The  narrow  linewidth  lasers  are  appropriately  amplified  with  erbium  doped  fiber 
amplifiers  (EDFAs)  to  provide  equalized  amplitudes  among  bit-parallel  WDM  channels  and  simul¬ 
taneously  modulated  by  an  Li:NbC>3  modulator  with  an  OC-48  non-return-zero  formatted  231  —  1 
pseudo-random  bit  sequence  (PRBS).  Using  only  one  modulator  with  a  rising  time  of  40ps  gives 
an  accurate  timing  alignment  among  all  bit-parallel  WDM  channels  and  a  360ps  flat  overlapping 
range  at  the  DSF  input.  In  addition,  imprinting  the  same  data  pattern  on  all  wavelengths  simulates 
the  worst  case  scenario  for  nonlinear  interactions  among  the  different  channels. 

The  synchronization  stage  uses  an  all  optical  switch:  Terahertz  Optical  Asymmetric  Demultiplexer 
(TOAD),  with  the  bit-parallel  WDM  data  entering  the  input  port  and  a  synchronous  OC-48  pi¬ 
cosecond  pulse  train  entering  the  clock  port.  Upon  incidence  of  a  clocking  pulse,  the  TOAD  opens 
a  small-switching  window  for  all  bits  or  WDM  channels  at  the  same  temporal  location  determined 
by  the  timing  of  the  clocking  pulse.  When  the  small  switching  window  is  placed  in  a  220ps  time 
region  (for  our  particular  setup)  where  all  bits  overlap,  the  different  WDM  channels  are  aligned 
precisely  in  time.  The  output  stage  of  the  experiment  uses  an  array  waveguide  grating  (AWG)  to 
demultiplex  the  different  WDM  channels  exiting  the  output  port  of  the  TOAD.  Optical  filters  are 


Fig.  1.  A  schematic  diagram  of  perfectly  synchronized  bit-parallel  WDM  data  transmission  over  a 
single  optical  fiber 
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Fig.  2.  (a)  BER  of  all  channels  for  bit-parallel  WDM  data  (  □ — Ai ,  A— As,  and-k—X 3)  at  a  switching 
■window  of  28ps,  (b)  BER  of  channel  Ai  at  TOAD’s  switching  windows  of  lOps  (□)  and  28 ps  (A). 


Delay:  35 ps  Delay:  8.1.6  ps  Delay:  185 ps 

A.  1 

\  .  '  1 

| - 1 

Fig.  3.  EYE  diagrams  of  the  channel  Ai  at  different,  sampling  locations.  Here,  the  sampling  time 
delay  values  are  relative  to  the  beginning  of  the  range  where  all  bit-parallel  WDM  channels  overlap. 

placed  both  before  and  after  AWG  to  eliminate  residual  clocking  pulses  and  amplifier  spontaneous 
emission  from  both  SOA  and  EDFAs.  An  0048  SONET  receiver  is  connected  to  a  bit  error  rate 
(BER)  tester  which  measures  the  BER.  of  each  channel  one  at  a  time. 

The  BER.  measurements  for  each  channel  of  the  precisely  aligned  bit-parallel  WDM  data  are  shown 
in  Fig  2(a)  for  a  TOAD’s  switching  window  of  28p.s.  BER.  10“ 10  is  obtained  for  every  channel  with 
a  231  —  1  PRBS.  A  few  dB  power  penalty  is  introduced  when  the  switching  window  is  reduced  to 
IQps  as  indicated  in  Fig.  2(b).  Here  powers  in  BER  plots  are  the  values  measured  at  the  input 
of  the  all  optical  switch.  Wide  open  EYE  diagrams  at  different  overlap  regions  of  the  bit-parallel 
WDM  channels  are  illustrated  in  Fig.  3  to  indicate  that  good  BER  can  be  obtained  at  any  location 
where  all  bit-parallel  WDM  data  overlap.  Here,  EYE  diagrams  were  obtained  by  using  a  10 GHz 
lightwave  converter  and  a  35 GHz  digital  sampling  scope. 

In  conclusion,  we  have  proposed  and  demonstrated  a  novel  bit-level  synchronization  technique  by 
perfectly  synchronizing  all  bit-parallel  WDM  data  transmitted  over  a  27.5 km  single  fiber.  By  sam¬ 
pling  temporally  skewed  bit-parallel  WDM  data  at  a  common  overlapping  range  with  an  all  optical 
switch(TOAD),  the  bit-parallel  WDM  channels  emerge  aligned  exactly  in  time.  Error  free  transmis¬ 
sion  on  all  WDM  channels  was  achieved.  The  working  range,  where  TOAD  samples  all  overlapping 
bit-parallel  WDM  channels,  is  220p.s  for  our  particular  experimental  setup.  For  transmissions  where 
the  time  skew  exceeds  140/js,  traditional  dispersion  compensation  techniques  must  be  employed  to 
contain  the  time-skew  within  2 20/3 .s  and  to  transmit  longer  distances. 

References 

1.  O.H.  Adamczyk,  M.  C.  Cardakli,  J.X.  Cai,  M.I.  Hayee,  C.  Kim,  and  A.E.  Willner,  “Coarse  and  fine  bit  syn¬ 
chronization  for  wdm  interconnections  using  two  subcarrier-multiplexed  control  pilot  tones”,  IEEE  Photonics 
Technol.  Lett.,  vol.  11,  pp.  1057-1059,  1999. 

2.  S.  Shen  and  A.  M.  Weiner,  “Demonstration  of  timing  skew  compensation  for  bit-parallel  -wdm  data  transmission 
with  picosecond  precision”,  IEEE  Photonics  Technol.  Lett.,  vol.  11,  pp.  566-568,  1999. 


60 


FC4.5 

4:45pm-5:00pm 

Penalty-Free  Polarization  Insensitive  Wavelength  Conversion  Using 
Bidirectional  Four- Wave  Mixing  in  a  Single  Semiconductor  Optical  Amplifier 

J.M.Tang,  P.S. Spencer  and  K. A. Shore 
School  of  Informatics,  University  of  Wales,  Bangor, 

BANGOR,  LL57  1UT,  UK. 

Tel:  +44(1248)382618;  Fax:  +44(1248)361429;  e-mail:  alan@sees.bangor.ac.uk 

1.  Introduction 

Four- Wave  Mixing  (FWM)  in  Semiconductor  Optical  Amplifiers  (SOAs)  has  been  extensively 
investigated  for  a  number  of  applications  in  optical  communications.  The  strong  polarization  sen¬ 
sitivity  of  the  mixing  process  is,  however,  a  major  issue  in  the  utilization  of  the  FWM  technique  in 
practical  networks.  Several  schemes  for  polarization  insensitive  FWM  in  SOAs  have  been  demon¬ 
strated  [1-3],  which  utilise  rather  complex  configurations  involving  several  optical  components.  Re¬ 
cently,  we  proposed  an  extremely  simple  scheme  [4]  for  polarization  insensitive  FWM,  utilizing  just 
one  polarization  insensitive  SOA  and  one  pump  wave.  Furthermore,  employing  two  pump  sources  in 
the  scheme,  polarization  insensitive  and  frequency-conversion  frequency-interval  independent  FWM 
[5]  can  be  also  obtained. 

In  all  the  aforementioned  configurations,  polarization  insensitive  SOAs  were  employed  to  achieve 
the  polarization  insensitive  FWM  performance.  In  this  paper,  we  will  show  that  polarization  in¬ 
sensitive  FWM  operation  can  also  be  realised,  using  a  polarization  sensitive  SOA  by  modifying  the 
configuration  presented  in  Ref.  4. 

2.  Polarization  insensitive  FWM  scheme 

The  proposed  polarization  insensitive  FWM  configuration  is  illustrated  in  Fig.l,  which  employs 
one  conventional  SOA  and  a  single  pump  source.  A  pump  wave  at  frequency  uv  and  a  collinear 
signal  wave  at  frequency  u>s  are  injected  into  a  Polarizing  Beam  Splitter  1  (PBS1).  After  passing 
through  the  PBS1,  these  waves  are  split  into  clockwise  and  counter  clockwise  orthogonally  polarized 
beams:  a  pump  field  and  a  copolarized  component  of  the  signal  field  propagate  clockwise  and  the 
other  two  copolarized  components  of  the  pump  and  signal  beams  travel  counter  clockwise.  The 
generated  conjugate  waves  appear  at  both  facets  of  the  SOA.  If  the  ratio  of  the  powers  between  the 
forward  and  backward  pump  beams  is  adjusted  in  accordance  with  the  optical  gains  of  the  SOA  for 
the  TE  and  TM  modes,  a  polarization  insensitive  conjugate  wave  can  be  obtained  after  combining 
at  the  PBS2. 

3.  Performance  analysis 

The  output  conjugate  frequency  of  interest  in  this  scheme  is  2u>p  —  los,  which  can  be  obtained 
through  nonlinear  mixing  between  both  co-propagating  and  counter  propagating  waves.  Based  on  a 
space-averaged  model  [6],  the  conjugate  waves  generated  by  co-propagating  pump  and  signal  waves 
are 

E pMff  =  xPpf  \f~PsG%  cos9r(ujp  -  (1) 

EP6,S6,P6  =  y Ppby/PsGlsinOr(up  -  Us)e^[(^P~^)t+(2kP-ks)z]_  (2) 

where  x  and  y  are  unit  vectors  parallel  to  the  TE  and  TM  modes  of  the  SOA;  Ppf,  Ppb  and  Ps  are 
the  input  optical  powers  for  the  clockwise  and  counter  clockwise  pump  as  well  as  signal  pulses;  Gx 
and  Gy  are  the  SOA’s  gain  for  the  TE  and  TM  polarized  waves;  6  is  the  polarization  direction  of 
the  signal  pulse;  r  is  the  complex  coupling  coefficient,  kj  >  0,  j  =  p,  s  are  wavenumbers. 

The  conjugate  wave  E pf,sf,pf  (Epb,sb,pb)  polarized  in  the  x  (y)  direction  does  not  suffer  from 
the  wave  number  mismatch  effect  [6],  and  travels  clockwise  (counter  clockwise)  along  the  loop. 
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Esf  Epf  Epb  Esb 


Figure  1:  Polarization  insensitive  FWM  scheme:  (a)  layout,  (b)  input  states  of  polarization  and  (c) 
output  spectrum.  PBS:  Polarizing  beam  splitter.  BS:  Beam  splitter. 


Meanwhile,  conjugate  waves  with  the  same  frequency  can  also  be  created  through  nonlinear  mixing 
between  the  counter  propagating  waves.  However,  it  is  sufficiently  accurate  to  neglect  these  conju¬ 
gate  waves  for  an  SOA  device  longer  than,  say,  ~lmm  and  a  frequency  detuning  larger  than  a  few 
hundred  GHz  [6].  If  the  ratio  of  the  pump  powers  between  the  forward  and  backward  pump  beams 
is  adjusted  to  satisfy 


LeL  =  (~IL)3/2 
P p6  G  x 


then  the  output  conjugate  power  from  the  PBS2  is 


(3) 


Pc  =  {PpfG3x  +  PpbGy)Psr2(ujp  -  us).  (4) 

Hence  a  polarization  insensitive  conjugate  power  can  be  measured  from  the  output  port  of  PBS2. 
Eq.(4)  indicates  that  the  proposed  scheme  does  not  compromise  the  achievable  conjugate  power 
and  signal  to  noise  ratio.  The  significant  advantages  of  the  scheme  also  include  extreme  simplicity, 
efficient  reduction  in  noise.  Therefore,  this  approach  would  significantly  broaden  the  potential 
applications  of  the  technique. 

This  work  was  supported  by  the  UK  EPSRC  under  grant  GR/M65717. 
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